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CHAPTER 0 


Introduction: What Is Logic? 


Mathematical logic is the study of mathematical reasoning. We do this by 
developing an abstract model of the process of reasoning in mathematics. We then 
study this model and determine some of its properties. 

Mathematical reasoning is deductive; that is, it consists of drawing (correct) 
inferences from given or already established facts. Thus the basic concept is that 
of a statement being a logical consequence of some collection of statements. In 
ordinary mathematical English the use of “therefore” customarily means that the 
statement following it is a logical consequence of what comes before. 


Every integer is either even or odd; 7 is not even; therefore 7 is 
odd. 


In our model of mathematical reasoning we will need to precisely define logical 
consequence. To motivate our definition let us examine the everyday notion. When 
we say that a statement o is a logical consequence of (“follows from”) some other 
statements 61,...,8,, we mean, at the very least, that o is true provided 0),..., On 
are all true. 

Unfortunately, this does not capture the essence of logical consequence. For 
example, consider the following: 


Some integers are odd; some integers are prime; therefore some 
integers are both odd and prime. 


Here the hypotheses are both true and the conclusion is true, but the reasoning 
is not correct. 

The problem is that for the reasoning to be logically correct it cannot depend 
on properties of odd or prime integers other than what is explicitly stated. Thus 
the reasoning would remain correct if odd, prime, and integer were changed to 
something else. But in the above example if we replaced prime by even we would 
have true hypotheses but a false conclusion. This shows that the reasoning is false, 
even in the original version in which the conclusion was true. 

The key observation here is that in deciding whether a specific piece of rea- 
soning is or is not correct we must consider alMathematical logic is the study of 
mathematical reasoning. We do this by developing an abstract model of the process 
of reasoning in mathematics. We then study this model and determine some of its 
properties. 

Mathematical reasoning is deductive; that is, it consists of drawing (correct) 
inferences from given or already established facts. Thus the basic concept is that of a 
statement being a logical consequence of some collection of statements. In ordinary 
mathematical English the use of “therefore” customarily means that the statement 
following it is a logical consequence of what 1 ways of interpreting the undefined 
concepts—integer, odd, and prime in the above example. This is conceptually easier 


al 
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in a formal language in which the basic concepts are represented by symbols (like 
P, Q) without any standard or intuitive meanings to mislead one. 
Thus the fundamental building blocks of our model are the following: 


(1) a formal language CL, 

(2) sentences of £L: 0,0,..., 

(3) interpretations for £: 2,%,..., 

(4) arelation / between interpretations for £ and sentences of £, with 2 - a 
read as “a is true in the interpretation 2,” or “2l is a model of o.” 


Using these we can define logical consequence as follows: 


DEFINITION -1.1. Let T = {61,...,0,} where 0),...,0, are sentences of £, and 
let o be a sentence of £. Then a is a logical consequence of T if and only if for 
every interpretation 2 of £L, 2% o provided 2 | @; for alli =1,...,n. 


Our notation for logical consequence is  — o. 

In particular note that T 4 o, that is, o is not a logical consequence of I, if 
and only if there is some interpretation 2 of £ such that 2 6; for all 0; € T but 
Ao, Ais not a model of oc. 

As a special limiting case note that @ Eo, which we will write simply as - o, 
means that 2 | o for every interpretation 2% of £. Such a sentence go is said to be 
logically true (or valid). 

How would one actually show that [ — o for specific [ and a? There will 
be infinitely many different interpretations for £ so it is not feasible to check each 
one in turn, and for that matter it may not be possible to decide whether a par- 
ticular sentence is or is not true on a particular structure. Here is where another 
fundamental building block comes in, namely the formal analogue of mathematical 
proofs. A proof of o from a set T’ of hypotheses is a finite sequence of statements 
00,---,0% Where o is ox and each statement in the sequence is justified by some 
explicitly stated rule which guarantees that it is a logical consequence of I’ and the 
preceding statements. The point of requiring use only of rules which are explicitly 
stated and given in advance is that one should be able to check whether or not a 
given sequence o9,...,0% is a proof of o from LT. 

The notation + o will mean that there is a formal proof (also called a deduc- 
tion or derivation) of o from T. Of course this notion only becomes precise when 
we actually give the rules allowed. 

Provided the rules are correctly chosen, we will have the implication 


ifltothenT -oa. 


Obviously we want to know that our rules are adequate to derive all logical 
consequences. That is the content of the following fundamental result: 


THEOREM -1.1 (Completeness Theorem (K. Gédel)). For sentences of a first- 
order language L, we have to if and only if &o. 


First-order languages are the most widely studied in modern mathematical 
logic, largely to obtain the benefit of the Completeness Theorem and its applica- 
tions. In these notes we will study first-order languages almost exclusively. 

Part ?? is devoted to the detailed construction of our “model of reasoning” for 
first-order languages. It culminates in the proof of the Completeness Theorem and 
derivation of some of its consequences. 
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Part ?? is an introduction to Model Theory. If I is a set of sentences of L£, 
then Mod(T), the class of all models of T, is the class of all interpretations of £ 
which make all sentences in I true. Model Theory discusses the properties such 
classes of interpretations have. One important result of model theory for first-order 
languages is the Compactness Theorem, which states that if Mod(I) = 0 then there 
must be some finite [9 C T with Mod(Io) = 0. 

Part ?? discusses the famous incompleteness and undecidability results of G’odel, 
Church, Tarski, et al. The fundamental problem here (the decision problem) is 
whether there is an effective procedure to decide whether or not a sentence is logi- 
cally true. The Completeness Theorem does not automatically yield such a method. 

Part ?? discusses topics from the abstract theory of computable functions (Re- 
cursion Theory). 


Part 1 


Elementary Logic 


CHAPTER 1 


Sentential Logic 


0. Introduction 


Our goal, as explained in Chapter 0, is to define a class of formal languages 
whose sentences include formalizations of the sttements commonly used in math- 
ematics and whose interpretatins include the usual mathematical structures. The 
details of this become quite intricate, which obscures the “big picture.” We there- 
fore first consider a much simpler situation and carry out our program in this 
simpler context. The outline remains the same, and we will use some of the same 
ideas and techniques-especially the interplay of definition by recursion and proof 
by induction—when we come to first-order languages. 

This simpler formal language is called sentential logic. In this system, we ignore 
the “internal” structure of sentences. Instead we imagine ourselves as given some 
collection of sentences and analyse how “compound” sentences are built up from 
them. We first see how this is done in English. 

If A and B are (English) sentences then so are “A and B”, “A or B”, “A implies 
B”, “if A then B”, “A iff B”, and the sentences which assert the opposite of A and 
B obtained by appropriately inserting “not” but which we will express as “not A” 
and “not B”. 

Other ways of connecting sentences in English, such as “A but B” or “A unless 
B”, turn out to be superfluous for our purposes. In addition, we will consider 
“A implies B” and “if A then B” to be the same, so only one will be included in 
our formal system. In fact, as we will see, we could get by without all five of the 
remaining connectives. One important point to notice is that these constructions 
can be repeated ad infinitum, thus obtaining (for example): 

“if (A and B) then (A implies B)”, 
“A and (B or C)”, 
“(A and B) or C”. 

We have improved on ordinary English usage by inserting parentheses to make 
the resulting sentences unambiguous. 

Another important point to note is that the sentences constructed are longer 
than their component parts. This will have important consequences in our formal 
system. 

In place of the English language connectives used above, we will use the fol- 
lowing symbols, called sentential connectives. 
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English word | Symbol Name 
and A conjunction 
or Vv disjunction 
implies > implication 
iff o biconditional 
not a negation 


1. Sentences of Sentential Logic 


To specify a formal language £, we must first specify the set of symbols of L. 
The expressions of Care then just the finite sequences of symbols of £. Certain 
distinguished subsets of the set of expressions are then defined which are studied 
because they are “meaningful” once the language is intepreted. The rules deter- 
mining the various classes of meaningful expressions are sometimes referred to as 
the syntax of the language. 

The length of an expression a, denoted lh(q), is the length of a as a sequence 
of symbols. Expressions a@ and ( are equal, denoted by a = £, if and only if a 
and £ are precisely the same sequence-that is, they have the same length and for 
each i the i*” term of a is the same symbol as the i” term of 6. We normally 
write the sequence whose successive terms are €9,€1,...,En AS ENE1---En. This is 
unambiguous provided no symbol is a finite sequence of other symbols, which we 
henceforth tacitly assume. 

In the formal language S for sentential logic, we will need symbols (infinitely 
many) for the sentences we imagine ourselves as being given to start with. We 
will also need symbols for the connectives discussed in the previous section and 
parentheses for grouping. The only “meaningful” class of expressions of S we will 
consider is the set of sentences, which will essentially be those expressions built up 
in the way indicated in the previous section. 

Thus we proceed as follows. 


DEFINITION 1.1. The symbols of the formal system S comprise the following: 
1) aset of sentence symbols: So, S1,...,Sn,.-. for alln €w 
2) the sentential connectives: A,V,3,0 
3) parentheses: (, ) 
We emphasize that any finite sequence of symbols of S is an expression of S. 
For example: 
))(S175 
is an expression of length 6. 
DEFINITION 1.2. The set Sn of sentences of S is defined as follows: 
1) S, € Sn for alln €w 
2) if ¢€ Sn then (Ad) € Sn 
3) if d,~ € Sn then (¢* w) € Sn where x is one of A, V, >, 
4) nothing else is in Sn 


To show that some expression is a sentence of S we can explicitly exhibit each 
step it its construction according to the definition. Thus 


((S3 \ (=S;)) > S4) E Sn 
since it is constructed as follows: 
S4, 51, (791), 93, (S3 A (7$1)), ((S3 A (791)) > Sa). 
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Such a sequence exhibiting the formation of a sentence is called a history of the 
sentence. In general, a history is not unique since the ordering of (some) sentences 
in the sequence could be changed. 

The fourth clause in the definition is really implicit in the rest of the definition. 
We put it in here to emphasize its essential role in determining properties of the 
set Sn. Thus it implies (for example) that every sentence satisfies one of clauses 
1), 2), or 3). For example, if o € Sn and lh(c) > 1 then o begins with ( and ends 
with ). So 4S,7 ¢ Sn. Similarly, (~S177) ¢ Sin since if it were it would necessarily 
be (=¢) for some ¢ € Sn; this can only happen if ¢ = S177, and S177 ¢ Sn since 
it has length greater than 1, but has no parentheses. 

The set Sin of sentences was defined as the closure of some explicitly given set 
(here the set of all sentence symbols) under certain operations (here the operations 
on expressions which lead from a, to (a A 8), etc.). Such a definition is called 
a definition by recursion. Note also that in this definition the operations produce 
longer expressions. This has the important consequence that we can prove things 
about sentences by induction on their length. Our first theorem gives an elegant 
form of induction which has the advantage (or drawback, depending on your point 
of view) of obscuring the connection with length. 


THEOREM 1.1. Let X C Sn and assume that (a) S, € X for alln € w, and (b) 
if 6,0 € X then (7d) and (6x Ww) belong to X for each binary connective x. Then 
X=Sn. 


PRroor. Suppose X # Sn. Then Y = (Sn— X) # 0. Let 0) € Y be such 
that [h(@0) < lh(@) for every 6 € Y. Then 6) 4 S,, for any n € w, by (a), hence 
00 = (7¢) or 09 = (@* w) for sentences ¢ and ~ and some connective x. But then 
Ih(@), lh(a) < lh(09) so by choice of 09, we have ¢,w € Y, ie. ¢,w € X. But then 
(b) implies that 69 € X, a contradiction. 


As a simple application we have the following. 


COROLLARY 1.2. A sentence contains the same number of left and right paren- 
theses. 


PrRooF. Let p;(a@) be the number of left parentheses in a a and let p,(a) be 
the number of right parentheses in a. Let X = {@ € Sn| pi(0) = p-(0)}. Then 
S, € X for all n € w since pi(Sn) = pp(Sn) = 0. Further, if ¢ € X then (nd) € X 
since pi((-¢)) = 1+ pi(¢), Pr((>¢)) = 1+ pr(d), and pi($) = pr(G) since 6 € X 
(i.e. “by inductive hypothesis”). The binary connectives are handled similarly and 
so X = Sn. 


The reason for using parentheses is to avoid ambiguity. We wish to prove that 
we have succeeded. First of all, what—in this abstract context—would be considered 
an ambiguity? If our language had no parentheses but were otherwise unchanged 
then —=S 9 A S$; would be considered a “sentence.” But there are two distinct ways 
to add parentheses to make this into a real sentence of our formal system, namely 
((-59) A Sy) and (=(S9 A S1)). In the first case it would have the form (a A £) 
and in the second the form (7a). Similarly, So > $1; — 52 could be made into 
either of the sentences ((So9 > $1) > S2) or (So — ($1 — $2)). Each of these has 
the form (@ > £), but for different choices of ~ and 8. What we mean by lack 
of ambiguity is that no such “double entendre” is possible, that we have instead 
unique readability for sentences. 
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THEOREM 1.3. Every sentence of length greater than one has exactly one of the 


forms: (7d), (Vv), (PAW), (é > Ww), (6 & W) for exactly one choice of sentences 
¢, (or ¢ alone in the first form). 


This result will be proved using the following lemma, whose proof is left to the 
reader. 


LEMMA 1.4. No proper initial segment of a sentence is a sentence. (By a 
proper initial segment of a sequence €9€1 ...En—1 18 meant a sequence EpE1.--Em-—1; 
consisting of the first m terms for some m <n). 


ProoF. (of the Theorem from the Lemma) Every sentence of length greater 
than one has at least one of these forms, so we need only consider uniqueness. 
Suppose @ is a sentence and we have 


6 = (ax B) = (a’ x’ f’) 
for some binary connectives x, *’ and some sentences a, 3, a’, 3’. We show that a = 
a’, from which it follows that * = «’ and 6 = f’. First note that if Ih(a) = Ih(a’) 
then a = a’ (explain!). If, say, lh(a@) < h(a’) then a is a proper initial segment of 
a’, contradicting the Lemma. Thus the only possibility is a = a’. We leave to the 
reader the easy task of checking when one of the forms is (=¢). 


We in fact have more parentheses than absolutely needed for unique readability. 
The reader should check that we could delete parentheses around negations—thus 
allowing —¢@ to be a sentence whenever ¢ is—and still have unique readability. In 
fact, we could erase all right parentheses entirely-thus allowing (6A wv, (@V wy, ete. 
to be sentences whenever ¢, 7 are—and still maintain unique readability. 

In practice, an abundance of parentheses detracts from readability. We there- 
fore introduce some conventions which allow us to omit some parentheses when 
writing sentences. First of all, we will omit the outermost pair of parentheses, thus 
writing -¢ or ¢A w in place of (=) or (6A w). Second we will omit the parenthe- 
ses around negations even when forming further sentences—for example instead of 
(359) A 51, we will normally write just —S 5 AS ;. This convention does not cuase 
any ambiguity in practice because (=(S9 A S;)) will be written as =(59 AS). The 
informal rule is that negation applies to as little as possible. 

Building up sentences is not really a linear process. When forming (¢ > w), 
for example, we need to have both ¢ and wW but the order in which they appear in 
a history of (¢ > vy) is irrelevant. One can represent the formation of (¢ > w) 
uniquely in a two-dimensional fashion as follows: 

TEE 


By iterating this process until sentence symbols are reached one obtains a 
tree representation of any sentence. This representation is unique and graphically 
represents the way in which the sentence is constructed. 

For example the sentence 


((S7 A (S4 > (>S0))) + (>(S3 A (So > S2)))) 


is represented by the following tree: 
TUE Eee 


We have one final convention in writing sentences more readably. It is seldom 
important whether a sentence uses the sentence symbols Sp, $13, and S7 or S93, S¢, 
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and Sj7. We will use A, B, C,... (perhaps with sub- or superscripts) as variables 
standing for arbitrary sentence symbols (assumed distinct unless explicitly noted 
to the contrary). Thus we will normally refer to A + (B — C), for example, rather 
than So ar (Si7 oad Si3). 


2. Truth Assignments 


An interpretation of a formal language must, at a minimum, determine which 
of the sentences of £are true and which are false. For sentential logic this is all that 
could be expected. So an interpretation for S could be identified with a function 
mapping Sn into the two element set {T, F'}, where T stands for “true” and F for 
“false.” 

Not every such function can be associated with an interpretation of S, however, 
since a real interpretation must agree with the intuitive (or, better, the intended) 
meanings of the connectives. Thus (=@) should be true iff ¢ is false and (¢ A w) 
shuld be true iff both ¢ and w are true. We adopt the inclusive interpretation of 
“or” and therefore say that (¢ V w) is true if either (or both) of ¢,w is true. We 
consider the implication (6 > ~) as meaning that w is true provided ¢ is true, 
and therefore we say that (¢ + w) is true unless ¢ is true and w is false. The 
biconditional (¢ 4 w) will thus be true iff ¢, W are both true or both false. 

We thus make the following definition. 


DEFINITION 2.1. An interpretation for S is a function t: Sn > {T, F} satisfy- 
ing the following conditions for all ¢,w € Sn: 
(i) t((4¢)) =T iff ¢(¢) = F, 
(ii) t((PAY)) =T iff tg) = ty) =T, 
(iii) t((@V w)) =T iff t(¢) = T or t(w) = T (or both), 
(iv) t((¢> W)) =F iff t(6) =T and t(w) = F, and 
(v) (6 4) iff t(4) = HCW). 


How would one specify an interpretation in practice? The key is the following 
lemma, which is easily established by induction. 


LEMMA 2.1. Assume t and t' are both interpretations for S and that t(S,) = 
U(S,) for alln €w. Then t(c) =t'(o) for alla € Sn. 


So an interpretation is determined completely once we know its values on the 
sentence symbols. One more piece of terminology is useful. 


DEFINITION 2.2. A truth assignment is a function h: {S,| n € w} > {T, F}. 


A truth assignment, then, can be extended to at most one interpretation. The 
obvious question is whether every truth assignment can be extended to an inter- 
pretation. 

Given a truth assignment h, let’s see how we could try to extend it to an 
interpretation t. Let o € Sn and let ¢o,...,¢, be a history of a (so d, =a). We 
then can define t on each ¢;,0 <i <n, one step at a time, using the requirements 
in the definition of an interpretation; at the last step we will have defined t(c). 
Doing this for every 0 € Sn we end up with what should be an interpretation t. 
The only way this could go wrong is if, in considering different histories, we were 
forced to assign different truth values to the same sentence ¢. But this could only 
happen through a failure of unique readability. 
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This argument can be formalized to yield a proof of the remaining half of the 
following result. 


THEOREM 2.2. Every truth assignment can be extended to exactly one inter- 
pretation. 


ProoF. Let h be a truth assignment. We outline how to show that h can be 

extended to an interpretation t. The main fact to establish is: 
(*) assume that h;,(S,) = h(S,,) for all n € w and hy : {o € 
Sn| lh(o) < k} > {T,F} satisfies (i)-(v) in the definition of 
an interpretation for sentences in its domain; then hz can be 
extended to hy+41 defined on {o € Sn| lh(c) < k+1} and which 
also satisfies (i)-(v) in the definition of an interpretation for all 
sentences in its domain. 


Using this to define a chain 


h=hy Che C... Chey... 


and we see that t = U{h,| & € w} is an interpretation, as desired. 


In filling in the details of this argument the reader should be especially careful 
to see exactly where unique readability is used. 


DEFINITION 2.3. For any truth assignment h its unique extension to an inter- 
preteation is denoted by h. 


Given h and o we can actually compute h(c) by successively computing h(¢;) 
for each sentence ¢; in a history ¢9,...,¢@, of o. Thus if h(S,,) = F for alln € w we 
successively see that h(S$4) = F, h($,) = F, h(-S,) =T, h(S3) = F, h($3 $1) = 
F, and finally h(($3 \.$,) + S4) = T. This process is particularly easy if o is given 
in tree form—h tells you how to assign T, F' to the sentence symbols at the base of 
the tree, and (i)-(v) of the definition of an interpretation tell you how to move up 
the tree, node by node. 

There are many situations in which we are given some function f defined on the 
sentence symbols and want to extend it to all sentences satisfying certain conditions 
relating the values at (=¢), (6A W), etc. to its values at ¢,w. Minor variations in 
the argument for extending truth assignments to interpretations establish that this 
can always be done. The resulting function is said to be defined by recursion , on 
the class of sentences. 


THEOREM 2.3. Let X be any set, and let g.:X > X andg,:X x X + X be 
given for each binary connective x. Let f : {Sn|n € w} + X be arbitrary. Then 
there is exactly one function f : Sn — X such that 


f(Sn) = f(Sn) for alln € wu, 


f(>6) = g(F(@)) for all ¢ € Sin, 


f(é«wv) = 9.(f(¢), FW)) for all 6, € Sn and binary connectives x. 
Even when we have an informal definition of a function on the set Sn, it 
frequently is necessary to give a precise definition by recursion in order to study 
the properties of the function. 


EXAMPLE 2.1. Let X =w, f(S,) = 0 for all n € w. Extend f to f on Sn via 
he recursion clauses 
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F((49)) = F@) +1 _ 

f((@*wv)) = f(¢) + f() + 1 for binary connectives x. 
We can then interpret f(@) as giving any of the following: 

the number of left parentheses in 6, 

the number of right parentheses in 0, 

the number of connectives in @. 


EXAMPLE 2.2. Let ¢o be some fixed sentence. We wish to define f so that f(0) 
is the result of replacing Sp throughout @ by ¢9. This is accomplished by recursion, 
by starting with f given by 


_ Po, n=0 
f&={ 9 225 
and extending via the recursion clauses 


f((46)) = (F(¢)), 


f((@xw)) = (f(¢) x f(W)) for binary connectives «. 


For the function f of the previous example, we note the following fact, estab- 
lished by induction. 


LEMMA 2.4. Given any truth assignment h define h* by 


* = h(d0), n=0 
h(Sn) = { h(S,), n£0 
Thus for any sentence 0 we have h*(0) = h(f(0)). 
Proor. By definition of h* and f we see that h*(S,) = h(f(Sn)) for all n. 


The recursion clauses yielding f guarantees that this property is preserved under 
forming longer sentences. 


Note that the essential part in proving that a sentence has the same number 
of left parentheses as right parentheses was noting, as in Example 1.3.1, that these 
two functions satisfied the same recursion clauses. 

As is common in mathematical practice, we will frequently not distinguish 
notationally between f and f. Thus we will speak of defining f by recursion given 
the operation of f on {S,| n € w} and certain recursion clauses involving f. 


3. Logical Consequence 


Since we now know that every truth assignment h extends to a unique in- 
terpretation, we follow the outline established in the Introduction using as our 
fundamental notion the truth of a sentence under a truth assignment. 


DEFINITION 3.1. Let h be a truth assignment and @ € Sn. Then @ is true 
under h, written h — 0, iff h(@) = T where h is the unique extension of h to an 
interpretation. 


_ Thus @ is not true under h, written h |- 0, iff h(0) AT. Thush 6 iff 
h(@) = F iffh & 76. 
We will also use the following terminology: h satisfies 6 iff h — @. 


DEFINITION 3.2. A sentence @ is satisfiable iff it is satisfied by some truth 
assignment h. 
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We extend the terminology and notation to sets of sentences in the expected 
way. 


DEFINITION 3.3. Let h be a truth assignment and © C Sn. Then © is true 
under h, or h satisfies 4, written h — &, iff h E o for every o € b. 


DEFINITION 3.4. A set © of sentences is satisfiable iff it is satisfied by some 
truth assignment h. 


The definitions of logical consequence and (logical) validity now are exactly as 
given in the Introduction. 


DEFINITION 3.5. Let 6 € Sn and & C Sn. Then @ is a logical consequence of 
» written © — 0, iff h & @ for every truth assignment h which satisfies U. 


DEFINITION 3.6. A sentence 6 is (logically) valid, or a tautology, iff 0 — @, i.e. 
h — @ for every truth assignment h. 


It is customary to use the word “tautology” in the context of sentential logic, 
and reserve “valid” for the corresponding notion in first order logic. Our notation 
in any case will be — @, rather than @ —& @. 

The following lemma, translating these notions into satisfiability, is useful and 
immediate from the definitions. 


LEMMA 3.1. (a) 0 is a tautology iff 40 is not satisfiable. (b) & E @ iff UU{-6} 


is not satisfiable. 


Although there are infinitely many (indeed uncountably many) different truth 
assignments, the process of checking validity or satisfiability is much simpler bec- 
dause only finitely many sentence symbols occur in any one sentence. 


LEMMA 3.2. Let 6 € Sn and let h,h* be truth assignments such that h(S,) = 
h*(S,,) for all S, in @. Then h(0) = h*(0), and thush — 0 iff h* 0. 


ProoF. Let Aj,..., An be sentence symbols, and let h, h* be truth assignments 
so that h(A;) = h*(A;) for all i = 1,...,n. We show by induction that for every 


6 € Sn, h(9) = h*(@) provided @ uses no sentence symbols other than A1,...,An. 
The details are straightforward. 


This yields a finite, effective process for checking validity and satisfiability of 
sentences, and also logical consequences of finite sets of sentences. 


THEOREM 3.3. Let Aj,...,An be sentence symbols. Then one can find a finite 
list hi,..., hm of truth assignments such that for every sentence @ using no sentence 
symbols other than Aj,...,A, we have: (a) — @ iff h; - @ for all j =1,...,m, 
and (b) @ is satisfiable iff hh; — 0 for some 7,1 <j <m. If further X is a set of 
sentences using no sentence symbols other than Aj,...,An then we also have: (c) 
LE #6 iffh; — 6 whenever h; — X, for each j =1,...,m. 


PRooF. Given Aj,...,An we let hy,...,hm list all truth assignments h such 
that h(S;,) = F for every S;, different from A1,...,A,. There are exactly m = 2” 
such, and they work by the preceding lemma. 


The information needed to check whether or not a sentence @ in the sentence 
symbols A,,..., A, is a tautology is conveniently represented in a table. Across the 
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top of the table one puts a history of 0, beginning with A;,...,A,, and each line 
of the table corresponds to a different assignment of truth values to A,,..., An. 

For example, the following truth table shows that (S3 A 45)) — $4 is not a 
tautology. 


Sy} S3 S4 aS} S3 A aS} (S3 /\ =) —> S4 
T | T|T F F T 

T/T ]F F F 

T| FT F F T 

T| FF F F T 

F/T] T T T T 

F/T |F T T F 

F/) FT T F T 

F/ FF T F T 


Writing down truth tables quickly becomes tedious. Frequently shortcuts are 
possible to reduce the drudgery. For example, if the question is to determine 
whether or not some sentence @ is a tautology, suppose that h(@) = F and work 
backwards to see what h must be. To use the preceding example, we see that 


h(($3 x 7S) => S4) =F 
iff h((S3 A 7$,)) =T and h(S4) =F 


and h(($3 A -=S1)) =T 
iff h(S,) = f and h(S3) = T. 
Thus this sentence is not a tautology since it is false for every h such that h(.S1) = F, 
h($3) = T, and h(S4) = F. 

As another example, consider 6 = (A > B) > ((34 > B) > B). Then h(6) = 
F iff h(A > B) =T and h((-A > B) > B= F. And h((-A > B) > B)=F 
iff h(1A + B) = T and h(B) = F. Now for h(B) = F we have h(A > B) = T iff 
h(A) = F and h(=A > B) = T iff h(A) = T. Since we can’t have both h(A) = T 
and h(a) = F we may conclude that @ is a tautology. 

Some care is needed in such arguments to ensure that the conditions obtained 
on h at the end are actually equivalent to h(). Otherwise some relevant truth 
assignment may have escaped notice. Of course only the implications in one direc- 
tion are needed to conclude @ is a tautology, and only the implications in the other 
direction to conclude that such an h actually falsifies 6. But until you know which 
conclusion holds, both implications need to be preserved. 

An analogous process, except starting with the supposition h(@) = T, can 
be used to determine the satisfiability of 0. If © is the finite set {o1,...,0,%} of 
sentences then one can check whether or not © — 6 by supposing h(0) = F while 
h(o;) =T for alli =1,...,k and working backwards from these hypotheses. 

An important variation on logical consequence is given by logical equivalence. 


DEFINITION 3.7. Sentences ¢,W are logically equivalent, written ¢@ FH yw, iff 
{o} Fv and {y} & ¢. 


Thus, logically equivalent sentences are satisfied by precisely the same truth 
assignments, and we will think of them as making the same assertion in different 
ways. 

Some examples of particular interest to us invole writing one connective in 
terms of another. 


16 1. SENTENTIAL LOGIC 


LEMMA 3.4. For any ¢,w € Sn we have: 


(a) (6>%) (46 Vy) 

(b) (vp) FA (o> ¥) 

() (@VP) FA aCe AnH) 

(4) (PAP) FA Ae V 7H) 

(e) (@A¥Y) FA 76> 79) 

f) oy) FA @>2HWAW> 9) 


What we want to conclude, using parts (b), (e), and (f) of the above lemma is 
that every sentence 6 is logically equivalent to a sentence 6* using the same sentence 
symbols but only the connectives =,—. This is indeed true, and we outline the steps 
needed to prove —6. 

First of all, we must define (by recursion) the operation * on sentences described 
by saying that 6* results from 6 by replacing subexpressions (Vw), (PAW), (6 & W) 
of 6 (for sentences ¢, yw) by their equivalents in terms of 4, given in the lemma. 

Secondly, we must prove (by induction) that for every truth assignment h and 
every 0 € Sin we have h(0) = h(6*). 

Details of this, and similar substitution facts, are left to the reader. 

Due to the equivalence (6Vw) VO FA dV (WV 8) and (bAW) AD FA dA(WWAB), 
we will omit the parentheses used for grouping conjunctions and disjunctions, thus 
writing AV BV CV D instead of ((AV B) VC) V D. 

Sentences written purely in terms of =,— are not always readily understand- 
able. Much preferable for some purposes are sentences written using 7, V,/A- 
especially those in one of the following special forms: 


DEFINITION 3.8. (a) A sentence @ is in disjunctive normal form iff it is a 
disjunction (0; V 02 V...V6,) in which each disjunct 6; is a conjugation of sentence 
symbols and negations of sentence symbols. (b) A sentence @ is in conjunctive 
normal form iff it is a conjunction (0; A 02 A... A @,) in which each conjunct 6; is 
a disjunction of sentence symbols and negations of sentence symbols. 


The advantage of having a sentence in disjunctive normal form is that it is easy 
to read off the truth assignments which statisfy it. For example 


(AAAB)V (AA BA-C)V (BAC) 


is satisfied by a truth assignment h iff either h(A) = T and h(B) = F or h(A) = 
h(B) = T and h(C) = F or h(B) = A(C) =T. 


THEOREM 3.5. Let @ be any sentence. Then there is a sentence 0* in disjunctive 
normal form and there is a sentence 6** in conjunctive normal form such that 


O0b46*,0FA0™. 


ProoFr. Let Ai,...,A, be sentence symbols. For any X C {1,...,n} we define 
Ox to be (¢1 A...,A¢n) where ¢; = A; if i € x and ¢; = 7A; if i ¢ X. It is then 
clear that a truth assignment h satisfies 0x iff h(A;) = T for i € X and h(A;) = F 
for i ¢ X. Now, given a sentence 6 built up using no sentence symbols other than 
Aj,...,An let 6* be the disjunction of all 6x such that (6 A 6x) is satisfiable— 
equivalently, such that — (0x — 0). Then 6* is, by construction, in disjunctive 
normal form and is easily seen to be equivalent to 6. If (@ A @x) is not satisfiable 
for any X then 6@ is not satisfiable, hence @ is equivalent to (A; A =A) which is in 
disjunctive normal form. 

We leave the problem of finding 6** to the reader. 
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Note that using #x’s, without being given any @ to begin with, we can form 
sentences 0* with any given truth table in A,,...,A,. Thus there are no “new” 
connectives we could add to extend the expressive power of our system of sentential 
logic. 


4. Compactness 


If ¥ is a finite set of sentences then the method of truth tables gives an effective, 
finite procedure for deciding whether or not » is satisfiable. Similarly one can decide 
whether or not = — @ for finite %& C Sn. The situation is much different for infinite 
sets of sentences. The Compactness Theorem does, however, reduce these questions 
to the corresponding questions for finite sets. The Compactness Theorem in first 
order logic will be one of our most important and useful results, and its proof in 
that setting will have some similarities to the arguments in this section. 


THEOREM 4.1. (Compactness) Let & C Sn. (a) % is satisfiable iff every finite 
Xo C & is satisfiable. (b) For 0 € Sn, 4 — 0 aff there is some finite Np C XU such 
that Xo E @. 


Part (b) follows from part (a) using part (b) of Lemma 1.4.1. The implication 
from left to right in (a) is clear, so what needs to be shown is that © is satisfiable 
provided every finite No C » is satisfiable. The problem, of course, is that different 
finite subsets may be satisfied by different truth assignments and that, a priori, 
there is no reason to assume that a single truth assignment will satisfy every finite 
subset of © (equivalently, all of X). 

Rather than taking the most direct path to this result, we will discuss in more 
generality correspondences between interpretatins and the sets of sentences they 
satisfy. In particular we look at the ways in which we could use a set © of sentences 
to define a truth assignment h which satisfies it. 

Given %, if we wish to define a particular truth assignment h which satisfies U 
we must, for example, either set h(So) = T or h(.So) = F. If So € & then we must 
make the first choice; if —S9 € & we must make the second choice. The only case 
in which we may be in doubt is if neither Sg nor —S 9 belongs in %. But even here 
we may be forced into one or the other choice, for example, if (Sp A —S3) € } or 
(ASp A S3) € B. 

Our definition of a complete set of sentences is intended to characterize those 
for which we have no choice in defining a satisfying truth assignment and for which 
we are not forced into contradictory choices. 


DEFINITION 4.1. A set [ C Sn is complete iff the following hold for all ¢,~ € 


) (74) € Tiff g ¢T, 
) (A) ET if éeTandwel, 
(iii) (6V~) ET iféeToryvel, 
) (67 w) ET iff (-¢) €T or pp eT, 
) (é@ wv) €T iff either both ¢,W €T or both ¢,% ¢T. 


DEFINITION 4.2. Given a truth assignment h, T(h) = {0 € Sn| h Eo}. 


Complete sets of sentences are exactly what we are after, as shown by the next 
result. 
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THEOREM 4.2. A set I of sentences is complete iff l = T(h) for some truth 
assignment h. 


PRooF. From right to left is clear because the clauses in the definition of 
complete sets mimic the recursion clauses in extending h to h. 

Conversely, if I is complete we define h by h(S,,) = T iff S,, € T and show by 
induction that a sentence @ belongs to I iff h(0) = T. 


Since clearly two truth assignments h;,h2 are equal iff T(h1) = T(h2) we 
have a one-to-one correspondence between truth assignments and complete sets of 
sentences. 

The relevance of this to proving the satisfiability of sets of sentences is the 
following consequence. 


COROLLARY 4.3. Let UC Sn. Then d is satisfiable iff there is some complete 
set I of sentences such that u CT. 


Thus our approach to showing that some set of sentences is satisfiable will be 
to extend it to a complete set. For the specific purposes of showing compactness 
we will need the following terminology. 


DEFINITION 4.3. A set % C Sn is finitely satisfiable iff every finite No C b is 
satisfiable. 


Thus our method in proving compactness will be to show that a finitely satis- 
fiable set % of sentences can be extended to a complete set I. We will construct 
this extension step-by-step, using the following lemma at each step. 


LEMMA 4.4. Assume » is finitely satisfiable and let 0 be a sentence. Then at 
least one of UU {0}, UU {0} is fnitely satisfiable. 


At the end of the construction the verification that the resulting set [ is com- 
plete will use the following two lemmas. 


LEMMA 4.5. Assume that %,, is finitely satisfiable and Uy C Un4y for alln € w. 
LetT =U,<, Un. ThenT is finitely satisfiable. 


new 


LEMMA 4.6. Assume that T is finitely satisfiable and for all sentences ¢ either 
@€T or (nd) €T. ThenT ts complete. 


We leave the proofs of these lemmas to the reader and proceed to give the 
construction. 

First of all, since our formal system S has only countably many symbols and 
every sentence is a finite sequence of symbols, it follows that S’n is a countable set, 
so we may list it as Sn = {bn| n € wh. 

Next we define, by recursion on n € w a chain {Un}new of finitely satisfiable 
sets of sentences as follows: 

Xo =U 
Xn U {dn}, if this is finitely satisfiable 
Mnt1 = { En U {adn}, otherwise 
n nS> 
The first lemma above establishes that in either case 41 will be finitely satisfiable. 


Finally, we let l = U,,<,, Un- Tis finitely satisfiable by the second lemma above. 
If ¢ € Sn then there is some n € w such that ¢ = ¢,. Thus either @ € U,4, CT 
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or (3d) € Hn41 CT by the construction. Thus we conclude that I is complete by 
the last lemma above. 

To return to the question with which we opened this section, how does the 
Compactness Theorem help us decide whether or not } | 0? Assume that we are 
given some explicit listing of © = {o,,|n € w}. Then} — Oiff , = {o0,..-,on} F 
6 for some n € w. Thus we check each n in turn to see if U, — @. If in fact © - @ 
then we will eventually find an n € w such that %, | @, and hence be able to 
conclude that © — 6. Unfortunately, if & A 6 this process never terminates and so 
we are unable to conclude that 5 0. 


5. Formal Deductions 


To complete the model of mathematical reasoning sketched in the Introduction 
we need to introduce the concept of a formal deduction. This does not play an 
important role in sentential logic because the method of truth tables enable us to 
determine which sentences are valid, so we only sketch the development in this 
section. 

We will specify a set Ao of validities to serve as logical axioms and a rule for 
deriving a sentence given certain others—both of these will be defined syntactically, 
that is purely in terms of the forms of the sentences involed. 

The rule, called modus ponens (MP), states that w can be derived from ¢ and 
(¢ + w). Note that application of this rule preserves validity, and more generally, 
iff E d¢dandIrE (¢> yw) then TE v. 

To minimize the set Ao we restrict attention to sentences built using only the 
connectives =,—. This entails no loss since every sentence of sentential logic is 
logically equivalent to such a sentence. 


DEFINITION 5.1. The set Ag of axioms of sentential logic consists of all sentences 
of the following forms: 
(a) (¢> (> ¢)) 
(b) (67 (> x)) += (> 4) = (> x) 
(c) (np > 79) > (Cod > 9) > 4) 


DEFINITION 5.2. Let f C Sn. A deduction form T is a finite sequence ¢o,..., On 
of sentences such that for each 7 < n one of the following holds: 
(i) ¢: € Ao UT 
(ii) there are j,k <i such that ¢, = (¢; > ¢i). 


We say ¢ is deducible from T, written + @, iff there is a deduction ¢9,..., bn 
from [ with ¢ = gp. 
The following is easily verified. 


LEMMA 5.1. (Soundness) If[ F @ thenT — ¢. 


To prove the completeness of the system we assume that [ / ¢ and show that 
TU {>¢} C I* for some complete set I'*, and thus I U {=¢} is satisfiable and so 
T -7¢. 

To explain what is going on in this argument we introduce the syntactical 
concept corresponding to satisfiability. 


DEFINITION 5.3. Let & C Sn. % is consistent iff there is no sentence ¢ such 
that % + @ and NF 7¢@. 
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Soundness easily implies that a satisfiable set % is consistent. The converse 
is proved by showing that if is consistent then 4 C T for some complete set I. 
This is similar to the argument in the preceding section for compactness—the lemma 
needed is as follows: 


LEMMA 5.2. Assume % is consisten and let @ be any sentence. Then at least 
one of SU {0}, UU {70} is consistent. 


To see that this yields completeness, we need to show that [U{-¢} is consistent 
provided [' / ¢. This uses the follwoing fact (the Deduction Theorem—also used in 
the preceding lemma): 


PROPOSITION 5.3. For any T,¢,wW the follwoing are equivalent: 
Pr (gp), TUL} Fy. 


We will look more closely at deductions in the context of predicate logic. 


6. Exercises 


DEFINITION 6.1. A set © of sentences is independent iff there is no sentence 
a €X such that (© \ {o}) Eo. 


DEFINITION 6.2. Sets ©; and “2 of sentences are equivalent iff %; -/ Ne and 
Me FX. 


(1) Let © = {(S, V S41) :n € w}. Prove or disprove: © is independent. 

(2) Let © = {(Spn41 3 Sp): € wh}. Decide whether or not & is independent. 

(3) Prove or disprove (with a counterexample) each of the following, where the 
sentences belong to sentential logic: 

(a) ify EO and w E @ then (pV w) FO; 
(b) if (pA) F@ then y FA orp E 8. 

(4) For any expression a let s(a@) be the number of occurences of sentence symbols 
in a and let c(a) be the number of occurences of binary connectives in a. Prove 
that for every o € Sn we have s(a) = c(a) + 1 

(5) Prove Lemma 1.2.3 about proper initial segments of sentences. [Hint: Why will 
a proper initial segment of a sentence not be a sentence?| 

(6) Decide, as efficiently as possible, whether or not 


{((C > B) = (A> -D),((B > C) — (D> A))$} E(B > AD). 


(7) Prove that every sentence o in which no sentence symbol occurs more than 
once is satisfiable, but that no such sentence is a tautology. 
(8) Assume © is a finite set of sentences. Prove that there is some %’ C ¥ such 
that X’ is independent and © and ©’ are equivalent. 
(9) Let © be an arbitrary set of sentences. Prove that there is some %’ such that 
&’ is independent and © and ©’ are equivalent. 
(10) Prove Lemma 1.5.3. [Since this is a lemma used to prove the Compactness 
Theorem, Theorem 1.5.1, you may not use this theorem in the proof.] 
(11) Assume that o — oy,» for all k € w. Prove that there is some n € w such that 
yo N:+:A Yn F ep for all k Ew. 
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(12) Give an example of a satisfiable sentence o and sentences y, for k € w such 
that o — vy, for all k € w but there is no n € w such that yo A---A Yn F Yr 
for all k € w. 

(13) Assume that o and yx are given so that for every assignment h we have 


h Eo iff (hk E ys for every k € w). 


Prove that there is some n € w such that yo A--:A Qn — x for all k € w. 


CHAPTER 2 


First-Order Logic 


0. Introduction 


In mathematics we investigate the properties of mathematical structures. A 
mathematical structure consists of some set A of objects (the domain, or universe, 
of the structure) together with some functions and/or relations on the domain— 
both must be specified to completely determine the structure. Thus the set Z of all 
integers can be the domain of many different structures on Z in which the functions 
+ and - are given; the ring structure in which also multiplication is considered; the 
(pure) order structure in which the relation < is given, but no functions; the ordered 
group structure in which <,+, and — are included; etc. 

In all these possible structures one considers not just the functions and relations 
acutally listed, but also the functions and relations which are generated or defined 
from them in certain ways. In practice, the allowable ways of generating more 
functions and relations may be left vague, but in our formal systems we need 
to be precise on this point. Certainly, in all cases we would be allowed to form 
compositions of given functions obtaining, for example, polynomials like x-~—y+a-z 
in the ring structure of Z. Normally constant functions would also be allowed, tus 
obtaining all polynomials with integer coefficients in this example. 

Similarly one can compose relations with functions obtaining, for example, re- 
lations like (e + #) < y-z in the ordered ring structure. Equality would also 
normally be used regardless of whether it was explicitly listed. Connectives like 
a, A, vee would enable us to form further relations. For example from binary rela- 
tions R(x, y), S(x,y) on A we define relations “R(x, y), the relation which holds if 
R fails; R(x, y) A S(x,y), the relation which holds iff both R and S hold; etc. 

In the ring structure on Z we would have, for example, the binary relation 
R(«,y) which holds iff ¢ = y-y. Thus R(1,1), R(4,2) would hold, R(2,1) would 
fail, etc. We would certainly also consider the new relation P(x) which holds iff 
R(a,y) holds for some y in the domain—P(z) iff « = y- y for some y € Z in this 
example. And from =R(x,y) we can define Q(x) which holds iff “R(x, y) holds for 
all y in the domain—Q(z) iff ay - y for all y € Z in this example. 

Finally the statements made about a structure would be statements concerning 
the relations considered—for example, the statements that P(a) holds for some x 
in the domain (true in this example) or that P(x) holds for every x in the domain 
(flase in this example but true if the domain is enlarged from Z to the complex 
numbers). Normally we would also be allowed to refer to specific elements of the 
domain and make, for example, the statements that P(4) holds or Q(3) holds—both 
true in this example. 

Our formal systems of first order logic are designed to mirror this process. 
Thus the symbols of a first order language will include symbols for functions, for 
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relations, and for fixed elements (“constants”) of a domain. Among the expressions 
we will pick out some which will define functions on a domain—and these functions 
will include the listed functions and be closed under composition. Similarly other 
expressions will define relations on a domain-and these relations will be closed 
under the operations outlined above. Finally, the sentences of the language will 
make assertions as indicated above about the definable relations. 

Some important points to notice: first of all, there will be many different lan- 
guages according to the selection of (symbols for) functions, relations, and con- 
stants made. Secondly, a given language may be interpreted on any domain, with 
any choice of functions, relations and elements consistent with the symbols—thus 
we will never have a language which must be interpreted on the domain Z or with 
a symbol which must be interpreted as +, for example. 


1. Formulas of First Order Logic 


We follow the outline in the previous section in defining the symbols of a first 
order language, the terms (which correspond to the functions) and the formulas 
(which correspond to the relations). In constructing formulas we use the symbols 
Vv and 4 for the quantifiers “for every” and “there is some” and we use = for the 
special relation of equality or identity which is in every first order language. 


DEFINITION 1.1. The symbols of a first order language Ccomprise the following: 
1) for each m > 0 some set (perhaps empty) of m-ary function 
symbols; 
2) some set (perhaps empty) of individual constant symbols; 
3) for each m > 0 some set (perhaps empty) of m-ary relation 


symbols; 
3a) the binary relation symbol for equality: =; 
4) a (countably infinite) list of individual variables: vp,...,Un,..- 


for all n € w; 
5) the sentential connectives: 7, A, V,—; 
6) the quantifiers: V, 5; 
7) parentheses: (, ). 


We will use (perhaps with sub- or superscripts) letters like F’,G for function 
symbols, c,d for constant symbols and R,S for relation symbols. Anticipating the 
formal definition of £-structure in the next section, an interpretation of £consists 
of a non-empty set A (the domain or universe of the interpretation) and for each 
m-ary function symbol F' an m-ary function F* on A, for each constant symbol c 
an element c* of A, and for each m-ary relation symbol R an m-ary relation R* 
on A-however = is always interpreted as actual equality on A. The variables will 
range over elements of A and quantification is over A. 

The symbols listed in 3a)-7) are the same for all first order languages and will be 
called the logical symbols of £. The symbols listed in 1)-3) will vary from language 
to language and are called the non-logical symbols of £. We will write £™ for the 
set of non-logical symbols of £. In specifying a language Lit suffices to specify £L™. 
Note that the smallest language will have £L’‘= 0). Note also that to determine 
Lone cannot just specify the set L’but must also specify what type of symbol each 
is, such as a binary function symbol. 

The terms of £will be those expressions of Cwhich will define functions in any 
interpretation. These functions are built from the (interpretations of the) function 
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symbols by composition. In addition we can use any constant symbol of Lin defining 
these functions, and we consider a variable v,, standing alone as defining the identity 
function. We also allow the “limiting case” of a function of zero arguments as a 
function. We thus have the following definition. 


DEFINITION 1.2. For any first order language £the set Tm, of terms of Lis 
defined as follows: (1) uv, € Tmg for every n € w, c € Tmg for every constant 
symbol of ¢ of £, (2) if F is an m-ary function symbol of Land t,...,tm € Tme 
then Ft, ...tm € Tme. 


This is, of course, a definition by recursion with the last clause “noting else 


is a term” understood. The reader may be surprised that we have not written 
F(t,,...,tm) but this is not required for unique readability (although it would 
certainly help practical readability at times). 

Just as with sentences of sentential logic we have a theorem justifying proof by 
induction on terms, whose proof we leave to the reader. 


THEOREM 1.1. Let X C Tmg and assume that (a) v, € X for alln€w,cEX 
for every constant symbol c of £L, and (b) whenever F' is an m-ary function symbol 
of Land ty,...,tm € X then Ft,...tm € X. Then X = Tmg. 


Even without parentheses every term is uniquely readable, as we leave to the 
reader to establish. 


THEOREM 1.2. For each t € Tme with lh(t) > 1 there is exactly one choice 
of m > 0, m-ary function symbol F of Land ty,...,tm € Tmg such that t = 
Fty,..-,tm- 


And finally, with unique readability we can define functions on Tm, by recur- 
sion. We leave the formulation and proof of this to the reader. 

In defining the class of formulas of first order logic we start with the formulas 
obtained by “composing” the given relation (symbols) with terms. 


DEFINITION 1.3. The atomic formulas of Care the expressions of the form 
Rt, ...tm for m-ary relation symbols R € £ and tj,...,tm € Tme. 


The atomic formulas are the basic building blocks for formulas, just as sentence 
symbols were the building blocks for sentences in sentential logic. 


DEFINITION 1.4. For any first order language Cthe set Fm, of formulas of Lis 


defined as follows: 
if ¢ is an atomic formula of £, then ¢ € Fm,;, 


2) if ¢€ Fm,z then (7d) € Fmg, 
3) if do €Fm,z then (dxv) € Fme 
for any binary connective x, 
4) if @€ Fme¢ then Vuzd, dung € Fmg for every n € w 


Note that atomic formulas do not have length 1; in fact in some languages there 
will be arbitrarily long atomic formulas. Nevertheless induction on length yields 
the following principle of proof by induction in which the atomic formulas are the 
base case. 


THEOREM 1.3. Let X C Fmg and assume that: (a) ¢ € X for every atomic 
formula @ of L, (b) 6 € X implies (7d) € X, (c) 6, € X implies that (bx) € X 
for binary connectives x, (d) 6 € X implies Vund, dund € X for everyn €w. Then 
X= Fm. 
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As with terms, or sentences of sentential logic, both unique readability and a 
principle of definition by recursion hold for Fm;. We leave both the formulation 
and proof of these to the reader. 

We give here some examples of terms and formulas in particular first order 
languages. 

(1) £“= 0. Here Tme = {v,| n € w}. Since =, being a logical symbol, belongs 
to every first order language, the atomic formulas consist of the expressions = v,Vx, 
for n,k € w. Specific formulas then include (= = v9v1), Sui(7 = vov1), ((= voviV = 
vov2) V (= v1v2)), Vuodur(7 = vov1), VuoVurVv2((= vov1V = vov2) V (= v102)). 

An interpretation for this language will be determined by some A # 9 as its 
domain. We will always interpret = as equality (“identity”) on the domain. It is 
thus clear, for example, that the formula (= = v9v) will define the relation R* (x, y) 
on A such that R*(a,a’) holds iff a 4 a’. Similarly the formula 4v,(7 = vpv1) will 
define the unary relation P*(x) on A such that P*(a) holds iff there is some a’ € A 
such that R*(a,a’) holds, i.e. a 4 a’. Note that P*(a) will hold of no elements a 
of A. 

(2) £"'= {R, F,c} where R is a binary relation symbol, Fa unary function sym- 
bol and cis a constant symbol. Now the terms of £also include c, Fu,, Fc, PF Fun, FF, 
etc. The atomic formulas consist of all expressions = tit2 and Rt,t2 for ty,t2 € 
Tm,-for example = cF'v1, RuoF vo, Revi. Further formulas will include (= = 
cFv1), Rvugv: 3 RF vp F v1), du, = vo Fv, Voi Revi. 

One familiar interpretation for this language will have domain A = w, interpret 
Ras <, F as immediate successor, and c as 0. That is R*(k,1) holds iff k < , 
F*(k) =k+1, c% =0. The term F'Fv, will ben define the function (F'Fu,)* 
defined as (F'F'v,)*(k) = F*(F*(k)) = k +2 for all k € w. The term FF will 
define the particular element F*(F*(0)) = 2 of w The formula Jv; = vo Fv, will 
define the unary relation on w which holds of k iff k = F*(1) for some | € w, that 
is, iff k =1+ 1 for some ! € w, thus iff k £ 0. 

Giving a precise definition of how terms and formulas are interpreted in com- 
plete generality is far from easy. One problem is that the relation defined, for 
example, by the formula (¢ V 7) is not just determined by the relations defined by 
@ and by w separately, but also depends on the variables used in ¢@ and in w and 
on how they are related. Thus, we have pointed out that for any choice of distinct 
variables v,,vz, the formula (= = v,v,) will define the binary relation R*(x,y) 
such that R*(a,a’) holds iff a 4 a’. But the formula ((— = vpvg) V (7 = Vmvr)) 
could define either a binary or ternary or 4-ary relation depending on the variables. 
The situation is even more complicated in our second example with the formulas 
(Rvov1V Rv, v2), (Rupv1 V Rv2v1), (Rvove V Rviv2) etc. all defining different ternary 
relations. 

Our solution here is to realize that the interpretation of a term or formula de- 
pends not only on the term or formula itself but is also dependant on the choice 
of a particular list of variables in a specific order. Thus in addition to beig inter- 
preted as the binary relation R* on A, the formulas Ruvpv; and Rv,v2 can each 
be interpreted as ternary relations relative to the list v9, v1, v2 of variables. Ruov1 
would then be the relation 55 such that 53(a,a’, a’) holds iff R*(a,a’) holds, and 
Rv v2 would then be the relation Sf such that S{(a,a’,a”) holds iff R*(a’,a’) 
holds. We can then say that (Ruov1 V Rv; v2) is interpreted by the ternary relation 
Sh (x,y, 2) V ST (a, y, 2). 
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What variables must occur in a list so that a term t or a formula ¢ will define 
a function or relation relative to that list? Clearly for terms this would be just the 
list of all variables occurring in the term. The answer for formulas is less obvious. 
We have pointed out, for example, that the formula dv; = vp Fv, defines a unary 
relation on A, despite having two variables. The reason, of course, is that the 
variable v; is quantified and so the formula should express a property of vo alone. 
Unfortunately the same variable can be both quantified and not quantified in the 
same formula, as shown (for example) by 


(Rv 1 v9 TT Av} = voF vy). 


This formula must be interpreted by (at least) a binary relation, since the first 
occurrence of v; is not bound by any quantifier. 

We are thus lead to the following definition of the variables which occur free in 
a formula, and which must therefore be among the variables listed when considering 
the relation. The formula defines in an interpretation. 


DEFINITION 1.5. For any ¢ € Fmg the set F'u(¢) of variables occurring free in 
@ is defined as follows: 
1) if d is atomic then F'v(¢) is the set of all variables occuring in 4; 
2) Fu(n¢) = Fv(¢); 
3) Fo(o« pb) = Fo(¢) U Foy); 
4) Fu(An¢) = Fu(Wun¢) = Fv(¢) — {un}; 
Thus in any interpretation a formula @¢ will define a relation in the list of its 


free variables. If ¢@ has no free variables then it will simply be either true or false 
in any interpretation, which justifies the following definition. 


DEFINITION 1.6. The set Sn¢ of sentences of Lis {¢ € Fme| Fu(d) = OF. 


We need to have a notation which will exhibit explicitly the list of variables 
considered in interpreting a term or formula. 


DEFINITION 1.7. 1) For any t € Tmg we write t = t(#1,...,%,) provided 
{x1,...,%n} contains all variables occurring in t. 2) For any ¢ € Fm¢ we write 
@ = $(#1,...,%) provided Fu(d) C {a1,..., ay}. 


We emphasize that the term or formula in question does not determine the 
list of variables nor the order in which they occur. Thus, if @ is dv; = vo F v1 
then we could have any of the following: ¢ = (vo), 6 = ¢(vo, v3), 6 = (v3, Vo), 
@ = G(vo, V1, v2), etc. The list of variables will determine the arity of the function 
or relation defined in any interpretation, and the order in which the arguments are 
taken from the variables. 

Consider $(vp) = Ju, = vo Fv. In any interpretation ¢(vp) will define the set 
(i.e. unary relation) consisting of all a € A for which a = F*(a’) for some a! € A. 
Let o = dv, = cFv,. Then o is a sentence and o will be true in an interpretation 
iff c* belongs to the set (¢(v9))* defined by ¢(vg). It is natural to express this by 
saying “c satisfies ¢” and to write o as ¢(c). Our definition of substitution will 
justify this usage. 


DEFINITION 1.8. a) Let ¢ € Tmg, 2 a variable and s € Tme. Then ?% is the 
term formed by replacing all occurrences of x in t by s. b) Let ¢ € Fmg, xa 
variable and t € Tm;. Then ¢/ is the result of replacing all free occurrences of x 
in @ by the term t-formally: 
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Or 


1) @ is ¢ with all occurrences of x replaced by t if ¢ is atomic; 
2) (a@)F = (047); 
3) (dx)? = (b? x vf) for x a binary connective; 
4) (Sun @)? = June if © = vy or Aun (7) if e F vp; 
) 


Similarly for (Vun¢)?. 


In particular, if t = t(x) we write t(s) for t?, and if d = ¢(x) we will write #(t) 
for $?. 

More generally we can define t71777°" and @;1"" as the results of simulta- 
neously substituting t1,...,¢, for all (free) occurrences of 71,...,@, in t, @ respec- 
tively. Note that we may have 


(Oi is Fotis F (Oe i - 
If ¢ = t(a1,...,@) and d = ¢(a1,...,2,) then we will write t(t1,...,tn) for 
tn? and O(t1,...,¢n) for dey". 


2. Structures for First Order Logic 


First order languages are interpreted on (mathematical) structures, among 
which we will find the usual structures studied in mathematics. The abstract defi- 
nition, which is very close to the informal definition from the preceding section, is 
as follows. 


DEFINITION 2.1. A structure for a first order language Cis a pair 2 = (A,Z) 
where A is some non-empty set (called the universe or domain of the structure) and 
T is a function whose domain is £™ satisfying the following: (1) if F is an m-ary 
function symbol of £then Z(F) = F* is an m-ary function defined on all of A and 
having values in A, (2) if ¢ is a constant symbol of Lthen Z(c) = c* € A, (3) if R 
is an m-ary relation symbol of Cthen Z(R) = R™ is an m-ary relation on A. 


Note that = is not in the domain of Z since it is a logical symbol, so it does 
not make sense to refer to Z(=) or =*. We also point out that the functions 
interpreting the function symbols are total-thus a binary function symbol cannot 
be interpreted, for example, as unary on w. 

We customarily use German script letters 21,%8,... to refer to structures, per- 
haps with sub- or superscripts. By convention the universe of a structure is denoted 
by the corresponding capital Latin letter, with the same sub- or superscript. 

In practice we suppress reference to Z and just give its values. Thus if £’= 
{R, F,c} where R is a binary relation symbol, F is a unary function symbol, and c is 
a constant symbol, we might specify a structure for Cas follows: 2 is the structure 
whose universe is w such that R*(k,1) holds iff k < 1, F™(k) = k +1 for all k 
and c* = 0. When the specific symbols involved are clear, we may just write the 
sequence of values of Z in place of Z. Thus the preceding example could be written 
as 2 = (w,<,s,0) where s : w — w is the (immediate) successor function. 

A structure is a structure for exactly one language £. If £; and £2 are different 
languages then no £y-structure can also be an £L2-structure. Thus if Lt! = {R, F,c} 
as above and £3! = {S,G,d} where S is a binary relation symbol, G is a unary 
function symbol and d is a constant symbol, then one £,-structure is 2 given above. 
An £o-structure could be 8 with universe w, with S interpreted as <, G as the 
successor function and d® = 0. Informally we could express 8 as B = (w, <, s,0)— 
but 2 and % are totally different structures since the symbols interpreted by <, s, 
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and 0 are different. If £3 = £1 U Le, so ee = £YuU ce’, then one L3-structure 
would be 2* with universe w, both R and S interpreted as <, both F and G as s 
and c™" = d™" = 0. It would be possible, but confusing, to write 


Qa" ane (w, <, 5, 0, — 5, 0). 


There is, however, one very important relation between structures in different 
languages, in which one structure is a reduct of the other to a smaller language. In 
this case the structures are equivalent as far as the smaller language is concerned 
and can be used interchangeably. 


DEFINITION 2.2. Let £1 and £2 be first order languages with £1 C Le (equiv- 
alently £3! C £3!). Let 2 be an Li-structure, B an Lo-structure. Then 2 is the 
reduct of 8 to £,, and % is an expansion of 2 to Lo, iff 2l and B have the same uni- 
verse and they interpret all symbols in £%/ precisely the same. We write &% = BL, 
if 2 is the reduct of 2 to Ly. 


Thus in the above examples of £1-, £2- and £3-structures, 2 = 2* | L; and 
B= A* Lo. Note that in spite of the terminology “expansion” that the universe 
of a structure remains fixed when passing to an expansion-only the language is 
expanded. 

One of the most important special cases of an expansion of a structure occurs 
when we add (new) constant symbols so as to name some elements of the structure. 


DEFINITION 2.3. Let Cbe a first order language, let 21 be an £-structure and 
let X C A. 

(a) L(X) = LU {cq| a € X} is the language obtained from Lby adding a new 
constant symbol c, for each a € X. 

(b) 2x is the expansion of 2 to an £(X)-structure such that 


ctx =a for allac X. 


In particular, 214 is the expansion of 2 obtained by adding constants for every 
element of A. In ordinary mathematical practice the structures 2 and 21 would 
not be distinguished-in talking about 2 you would naturally want to talk about 
arbitrary elements of A, which means having constants for them in your language 
when you formalize. 

We will also take the point of view that in talking about 2l you will frequently 
wish to refer to specific elements of 21, but we will always carefully distinguish 214 
from 2. 

We also emphasize that there is no way that we could—-or would want to if we 
could-ensure at the outset that Ccontained constants to name every element of 
every £-structure. Since there are £-structures 2% with |A| > |L£| the first point 
is clear. For the second, recall the language Cabove with L™= {R, F,c} and the 
L£-structure WI = {w,<,s,0}. Another £-structure one would naturally wish to 
consider would be 8 = (Z, <,s,0). But if Zhad constants to refer to every element 
of Z then those constants naming negative integers could not be interpreted in 2, 
i.e. as elements of w, in any natural way. 

To recapitulate, a language £determines the class of £-structures, whose uni- 
verses are arbitary (in particular arbitrarily large) non-empty sets. In studying 
any particular £-structure 2, we will customarily pass to the language £(A) and 
the expansion 24). but in comparing two different C-structures 21,8 we must 
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use properties expressible in £Lsince L(A) will not normally have any “natural” 
interpretation on % nor will £(b) normally have any “natural” interpretation on 2. 

We now proceed to give a very intuitively natural definition of the truth of 
a sentence of Lon 2,4. Since every sentence of Lis also a sentence of £(A) this 
definition will, in particular, determine when a sentence of Lis true on 2,4. And 
since 2 and 2,4 are identical as far as Lis concerned, we will take this as the 
definition of the truth of a sentence of £on the given L-structure 2. 

An atomic formula Rt, ...tm (or = tit2) is a sentence iff the terms ¢1,...,tm 
contain no variables. We will want to say that Rt,...tm is true on 2, iff the 
relation R™ (equivalently R™4) holds of the elements of A which are named by 
the terms t),...,tm. If £has function symbols we need to first give a definition 
by recursion stating how terms without variables (also called closed terms) are 
evaluated. 


DEFINITION 2.4. Given an £-structure A% we define the interpretation t”4 of 
closed terms t of £(A) in 4 as follows: 
(1) if t is a constant symbol c of £(A) then t¥4 = c™4; 
(2) ift = Fty...t» for closed terms t1,...,tm of £(A) then 
$e PAE 5 3d), 


DEFINITION 2.5. Given an L-structure 2 we define the truth value 674 of 
sentences 6 of L(A) in 24 so that 6%4 € {T, F} as follows: 


1) if @ is Rt,...tm for closed terms ty,...,tm and R € L”™then 
g%4 = T iff R%™(174,...,¢24) holds; 

2) if 0 is = tyte for closed terms t1, tz then 074 = T iff in? = its 

3) if 0 =-¢ then 674 = T iff 674 = F; 

4) if8@=(dAy) then 674 = T iff p74 = p44 =T; 

5) if6=(¢V vw) then 64 = F iff #4 = y"4 = F; 

6) if6=(¢—> 7) then 6*4 = F iff 64 =T and y™4 = F; 

7) if 0 =Vun¢d then ¢ = d(vp) and 074 = T iff ¢(c,)"4 = T for all 
ac A; 

8) if 0 = Jvnd then ¢ = O(vn) and 6%4 = T iff 6(ca)"4 = T for some 
acA; 


NOTATION 1. Let 2 be an £-structure. (a) If 8 € Sng: then 2%4 — 4, read 0 
is true on As or XA satisfies 0, iff P4 = T. (b) If 0 € Sng then AE 4, read @ is 
true on L4 or Wy satisfies 6 or Wis a model of O, iff Aa EO. 


The above definition is designed to capture the “common sense” idea that, say 
4dx¢(x) is true on a structure iff ¢ holds of some element of the structure. We pass 
to the expanded language precisely so as to be able to express this “common sense” 
definition using sentences of a formal language. 

We extend our notations t™4, 64 to arbitrary terms and formulas of £(A) as 
follows. 


DEFINITION 2.6. Let t(a1,.-.,%n) € Tmgca). Then t*4 is the function on A 
defined as follows: for any a1,...,an, € A, t™4(a1,...,@n) =t(Ca,,--+;Ca,) 4. If t 
is actually a term of Cwe write t™ for the function t™4. 

DEFINITION 2.7. Let $(71,...,2n) € Fmgca). Then #4 is the n-ary relation 


of A defined as follows: for any a1,...,@, € A, ¢™4(a1,...,@n) holds iff %,4 
$(Ca,;+++3€a,)- If @ is actually a formula of Cwe write ¢” for the relation 674. 


l 
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Just as in the informal discussion in the preceding section, the definitions of 
the functions ¢*4 and relations ¢™4 are relative to the list of variables used, but 
this ambiguity causes no problems. 


DEFINITION 2.8. Given an £-structure 2, an £(A)-formula $(x1,...,2,) and 
elements a1,...,@n © A we say that ¢ is satisfied by aj,...,@n in A, iff Aya — 
O(Ca,;--+5Ca,)- If ¢ is in fact a formula of £we will say it is satisfied by a1,...,@n 


in 2 instead of 24. In each case we will say ¢ is satisfiable in 24 or 2 to mean it 
is satisfied by some aj,...,@n. 


Note that if 0 is a sentence of £(A) then either @ is satisfied by 24 or (6) is 
satisfied by 24, but not both. 

In extreme cases it may make sense to talk of a formula (with free variables) 
being true on a stucture. 


DEFINITION 2.9. Given an £-structure 2% and a formula ¢ = $(21,...,2n) of 
L(A) we say ¢ is true on Ay, written A, - ¢, iff Az E P(ca,,.--,Ca,) for all 
Q1,---,4n € A. If dis a formula of Lwe say ¢ is true on A and write AE ¢. 


We thus see that the following are equivalent: 21 ¢, a¢ is not satisfiable in 2, 
AE Vaz---Vand. At most one of AE ¢, 2 E 7d will hold but in general neither 
of them will hold. 

We proceed to a series of examples, using the language Cwhose non-logical 
symbols are precisely a binary relation symbol R and a unary function symbol F’. 
2 E= vx for all 2, since = is interpreted by actual equality in every L- 
structure. Hence also 2 | Vx = xz for all 2. 

If x,y are different variables then = zy is satisfiable in every 2, since 24 -= 
CaCa for all a € A; hence AE Ardy = vy for all A. Hwever = zy is true on A 
iff A contains at most (and so exactly) one element; thus also % | VaVy = xy iff 
|A| = 1. 

Similarly = = xy (for different variables, x,y) is satisfiable on 2 iff 2 
Ardy = vy iff |A| > 2. Analogously for x1, x2, x3 all different variables the formula 


l 


5 EB 41% N74 = %1%3 N37 = 2243 


is satisfiable in 2 iff |A| > 3. 

More gernerally, for each positive integer n we obtain a formula ¢,,(21,...,@n) 
without quantifiers (hence called a quantifier-free formula) which is satisfiable in 
QA iff |A| > n. If we define 6, to be the sentence 4x, ---3r,¢, then AE 6, iff 
|A| > n. We then have 2% | (0, A 7@n41) iff |A| = n. Given integers k,l,n with 
k > 2,k <1 <n we could also, for example, write down a sentence o such that 
AE o iff either |A] < k or |A] = 1 or |A| > n. Note that these formulas and 
sentences use no non-logical symbols and thus will belong to every language. 

We now consider two particular £-structures: 2% = (w, <,s) and 8 = (Z,<,s). 

If do(x) is JyRry then ¢} = w, ey = Z, hence both structures are models of 
the sentence VadyRay. 


If di(x) is VyRxy then 67 = {0} and ¢P = 0, hence A —/ ArVyRry by 
BE a eaee 

If ¢o(x) is Sy = xFy then 63 =w — {0} but ¢? = Z. Thus 8k Vary = aFy 
but 2 = “vray = = «Fy, that A - drVyn= ea 

We noted above that $1(«) is such that = {0}. If we now define ¢3(y) to 


be da(¢1(x)A = yFx) then gf = {1}. In ite same way we can find, for every 
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k €w, a formula Wy (y) such that v2 = {k}. Are there formulas x, for k € Z such 
that x7 = {k}? Note that it would suffice to show that there is a formula yo with 
xo = {0}. 

We conclude this section with three important facts about the truth or satisfi- 
ability of substitutions. 

First, suppose Lis a language containing (among other things) an individual 
constant symbol d. Let 2% be an L-structure and let aj = d*. Then in L(A) we 
will have also the constant symbol cq, and in 24 both d and c,, will be interpreted 
as the element ao. If ¢(a) is a formula of £(A) then, by definition, we will have 
A, -E Vad(x) iff M4 — O(c.) for alla € A. A priori we could have %4 | Vrd(x) 
even though 24 - 7¢(d), although this would clearly be undesirable. Luckily we 
can prove that this counter-intuitive state of affairs never occurs. 


THEOREM 2.1. Let & be an L-structure, let t be a closed term of L(A), let 
ag = t™, and let o(x) be any formula of L(A). Then 


Wa F Olt) uf Xa F P(Cao): 


In particular 
Ba E Vad(x) > A(t). 

Our second fact attempts to generalize the first to the case in which the term 
need not be closed. That is, if 21 is an £-structure, (x) is and £(A)-formula and 
t is a term of C£what can we say about the relation between ¢(x) and ¢(t)? In 
particular will we still have %4 —& (Vx@(x) > o(t))? [Note that this will normall 
not be a sentence, due to the variables in t.] 

As the simplest possibility, consider the case in which ¢ is just another variable 
y. The desired result, then, is that ¢ = ¢(x) and ¢(y) = ¢] both define the same 
subset of A in 2,4-that is, for every a € A we have A4 — $2, iff Wa & (7). 

In this even we will certainly have %4 — Vrd(x) > ¢(y). Unfortunately there 
are certainly problems depending on how y occurs in ¢. For example, let ¢(x) be 
dy7 = xy. Then ¢(y) is the sentence Jy = yy, which is always false, and hence 
whenever |A| > 2 we will have A 4 Vrd(x) > o(y). What went wrong here is 
that, in passing from ¢ to gy some of the near occurrences of y became bound-—if 
this did not happen there would be no problem. The formal definition of “no new 
occurrences of y become bound” is given in the following definition. 


DEFINITION 2.10. For any Zand any variables x,y we define the property “y 
is substitutable for x in @” for @ € Fm, as follows: 
1) if d is atomic then y is substitutible for x in ¢, 
2) if d = (Aw) then y is substitutible for x in ¢ iff y is substitutible for « in y, 
3) if ¢é = (W* x) where x is a binary connective, then y is substitutable for x 
in @ iff y is substitutible for x in both w and x, 
4) if 6 = Vunw or @ = Avpw then y is substitutible for x in ¢ iff either 
x ¢ Fv(¢) or y £ vp and y is substitutible for x in w. 


Note in particular that x is substitutible for x in any ¢, and that y is substi- 
tutible for x in any ¢ in which y does not occur. 
The following result states that this definition does weed out all problem cases. 


THEOREM 2.2. Let 2 be an £L-structure. (1) Let o(x) € Fmgca) and assume y 


is substitutible for x in bd. Then o*%4 = (pe)FA. (2) Let 6 © Fmga) and assume 
y is substitutible for x in d. Then Xa = (Vxd > Gf). 
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In an entirely analogous fashion we can define, for arbitrary terms t of £, the 
property t is substitutible for x in ¢ to mean (informally) no new occurrences in $f 
of any variable y occurring in t become bound. We leave the precise formulation of 
this to the reader. The resulting theorem is exactly what we were after. 


THEOREM 2.3. Let 2 be an L-structure, @ € Fmgiay, t € Tm). Assume t 
is substitutible for x in b. Then Xa — (Vxg - Of). 


We remark, finally, that we can extend our notion of substituition of a term for 
a variable x to a notioin of substitution of a term for a constant c. We leave to the 
reader the task of defining #f, and ¢f'";". The main properties we will require are 
summarized in the following theorem. 


THEOREM 2.4. Let 6 € Fmg and let y be a variable not occurring in @. Then 
(1) ¢ does not occur in ¢, (2) (by)% = ¢. 


3. Logical Consequence and Validity 


The definitions of logically true formulas, and of logical consequences of sets of 
sentences, now are exacgly as expected. Some care, however, is needed in defining 
logical consequences of sets of formulas. 


DEFINITION 3.1. Let ¢ be a formula of £. (1) ¢ is logically true or valid, 
written E 4, iff & — ¢ for every £-structure A. (2) ¢ is satisfiable iff ¢ is satisfiable 
on some £-structure 2. 


The basic connection between satisfiability and validity is just as in sentential 
logic. In addition the validity and satisfiability of formulas can be reduced to that 
of sentences. 


LEMMA 3.1. Let 6 = (21,...,%n) € Fmg. 
(1) - ¢ iff n¢ is not satisfiable 
(2) Fé if-Vr1---Vind 
(3) o ts satisfiable iff dv,--- Arn is satisfiable 


Since there are infinitely many different £-structures for any language Cone has 
no hope of checking them all to determine, for example, if some given formula is 
valid. Nevertheless, one can frequently figure this out, as a few examples will make 
clear. 


EXAMPLE 3.1. Let Lbe a language with unary relation symbols P and Q. 
Determine whether or not ¢ is valid where 


a =Va(Px > Qz) > (VaPx > VeQz). 
Suppose 2 4 o, hence 2% / —« since o is a sentence. Then A - Va(Px > Qa), 
AE VaPx but AF VeQx. The last assertion means that 24 / 7Qca, for some 


ao € A. But the other two assertions imply that 24 F Pca, and 24 - (Pca, > 
Qca,); which contradict 24 FE 7Qca,. Thus we conclude a is valid. 


EXAMPLE 3.2. Determine whether or not @ is valid where 
6 = (VaPx > VrQxz) — Va(Px > Qz). 
Suppose 2 JF 6, hence 2 — 70.Then 2 — (VaPx > VrzQz) but A Vr(Px > Qa). 


The last assertion means that %4 - Pc,, and %4 - 7Qc,, for some ag € A. 
The first assertion breaks into two cases. In case 1, 2% KE VaPx and in case 2, 
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2 VaeQax. Case 2 is contradicted by the other information, but case 1 will hold 
provided 24 F —Pc,, for some a, € A. We thus conclude that @ is not valid since 
we will have 2 / 0 whenever there are elements ag,a1,€ A such that ag € P™, 
ago ¢ Q™, a, ¢ P™. For example, we can define % by specifying that A = {0,1}, 
PY = {ao}, Q* = 9. 


We can generalize the result established in Example 2.4.1 as follows. 


EXAMPLE 3.3. For any formulas ¢, w of any L, 
E Va(o > w) > (Vad > Vaw). 
Choose variables y1,..., Ym such that @ = O(a, y1,.--,Yn) and w = U(x, y1,---,Yn)- 
Suppose 2 is an £-structure such that % 4 Va(d > w) > (Vad > Vaw). Note 
that we cannot conclude that 2% Va(¢ > w) etc. since Vz(¢ > w) is presumably 
not a sentence. We can, however, conclude that there are aj,...,a, € A such that 
Ms |F O(Ca,,---;Ca,) [Where 6 = O(y1,..-,Yn) = (Va(d > W) > (Vad > Ve))] 
hence-since this is now a sentence— 

Wa E Va(h(L, Cays-- +s Can) 7 W(; Cays+++3Can)): 


The rest of the argument proceeds as before. 


Preparatory to defining logical consequence we extend some notations and ter- 
minology to sets of formulas and sentences. 


DEFINITION 3.2. If [ C Fmg then we will write [ = I'(a1,...,2,) provided 
Fu(@) C {a1,..., Un} for all @ ET. 


DEFINITION 3.3. If © C Sn¢ and A is an £-structure then we say 2 is a model 
of &, written 2 — %, iff 2 o for every o € DU. 


DEFINITION 3.4. (1) If [f C Fmg, T = I(a,...,%p) , and ay,...,@n are 
elements of an £-structure 2, then T is satisfied on 2 by a1,...,@n, written A4 = 
T'(Ca,,--+;Ca,); iff every formula in T is satisfied on 2 by ai,...,@n. (2) If T 
T(a1,..-,%n) C Fme¢ and 2 is an £-structure then we say T is satisfiable in 2 iff T 
is satisfied on 2% by some aj,..., Gn. 


I 


Note that if [ is satisfiable in 2l then every ¢ € T is satisfiable in 2 but 
the converse may fail. A trivial example is given by [T = {= xzy,7 = xy} with 
2( any structure with at least two elements. A non-trivial example is given by 
T = {¢p(x)| 1 < n € w} where ¢)(%) = AyRF yx, bo(x) = JyRFF yz, etc. in 
the language Cwhose none-logical symbols are a binary relation symbol R and 
a unary function symbol F. Consider the two L£-structures 2% = (w,<,s) and 
8 = (Z,<,s). Then 6 = Z for every n € w— {0}, hence I is satisfiable in 8. But 
ot = {k €w|n<k} for each n €w — {0}. Thus T is not satisfiable in % although 
every formula in [indeed every finite [9 C T-is satisfiable in 2. 


DEFINITION 3.5. (1) A set © of sentences is satisfiable iff it has a model. (2) A 
set T =[(a1,...,2,) of formulas is satisfiable iff [ is satisfiable in some structure 
QL. 


Note that we have only defined satisfiability for sets [ of formulas with only 
finitely many free variables total while we could extend these notions to arbitray 
sets of formulas, we will have no need for these extensions. 

We finally can define logical consequence. 
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DEFINITION 3.6. (1) Let © C Sn¢, d € Fmg. The ¢ is a logical consequence of 
= written © — ¢, iff 2 ¢ for every £-structure 2 which is a model of ©. (2) Let 
[I CFmg, ¢ € Fmg and suppose that T = T(a1,...,¢n), 6 = O(a1,...,2%n). Then 
@ is a logical consequence of [ written T — @, iff Xa — O(ca,,---,Ca, ) for every 
£-structure 2% and every a,...,@, € A such that Aa E T(ca,,.--, Ca, )- 


Part (1) of the definition is as expected. We comment on part (2) and give 
some examples. First of all, the only restriction on T is that its formulas contain 
only finitely many free variables total-since then one can certainly find a single list 
1,---,2% Of variables which includes all variables occurring free either in ¢ or in 
formulas in T. The definition is also independent of the precise list used. 

Next, the definition in part (1) is a special case of the definition in part (2). 
Thus if © C Sng and $(a1,...,%) € Fmg then also © = U(a1,...,2,). Now if 
Qt — v then in particular % / U(ca,,.--,Ca,) for all a1,...,@, € A. Thus the 
definition in part (2) yields %4 F @(Ca,,---,Ca,,) for all a1,...,a@n € A, and thus 
A - ~ as required for the definition in part (1). On the otherhand if 2 is not a 
model of © then neither definition yields any conclusion about the satisfiability of 
@ in 2. 

The definition is formulated to make the following result hold. 


LEMMA 3.2. For any =T(a1,...,@n), O(@1,---,@n), W(@1,.--,2n) we have 


PU{O Fv HEE (6 y). 


Proor. [ — (¢ > y) iff there are 2 and ay,...,@, such that %4 ET (ca,,.--, Ca, ) 


but 4 EF (b(Ca,,+-+3 Can) 4 W(Cars+++3Ca,)); that is, ay,...,@, satisfy PU {¢} in 
2 but do not satisfy , thus iff 2 and aj1,...,@, also show that TU {¢} KF w. 


Thus we see, for example, that {Ray} 4 VyRay since (Rry > VyRay) is not 
valid. On the other hand {Ray} — JyRey since (Rry > AyRzy) is valid. 

In the remainder of this section we identify some classes of validities and we es- 
tablish some further properties of the logical consequence relation. These validities 
and properties will then be used in the next section to establish a method which 
enables one to “mechanically” generate all the logical consequences of a given set 
Tr. 


To begin, tautologies of sentential logic can be used to provide a large class of 
validities of first order logic. For example (Sp > (S$; > So)) = 0 is a tautology. Of 
course it isn’t even a formula of any first order language £. But if ¢o,¢1 € Fme 
then the result of replacing So by ¢o and 5; by ¢1 throughout @ is the formula 
0* = (¢0 > (¢1 > ¢0)) of L, and | 6* for the same reasons that 6 is a tautology, 
as the reader should check. The same thing occurs regardless of what tautology 
one starts with; thus suggesting the following definition. 


DEFINITION 3.7. A formula 7 of Lis a tautology iff there is some tautology 6 
of sentential logic and some substitution of £-formulas for the sentence symbols in 
Lwhich yields the formula ~. 


Despite the “existential” nature of this definition one can in fact check any 
given formula w of Lin a finite number of steps to decide if it is a tautology. The 
point is that there will only be finitely many sentences 6 of sentential logic (except 
for the use of different sentence symbols) such that w can be obtained from @ by 
some such substitution, and each such @ can be checked to determine whether it is 
a tautology. 
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For example let o be the sentence 
(Vuo(Pup 4 Quo) > (VuoP v9 + VuoQvo)). 
Then o can be obtained only from the following sentences 6; of sentential logic: 
I9=A 
0, = (A B) 
02 = (A> (B>C)). 
Since none of these is a tautology (for distinct sentence symbols A, B,C), o is 


not a tautology either, although — oa. 
We leave the proof of the following result to the reader. 


THEOREM 3.3. [fw € Fmg is a tautology, then — w. 


The following list of facts is left to the reader to establish. 


THEOREM 3.4. (1) — (Vad — o?) whenever t is substitutible for x in ¢; 

2) K (¢ > Vad) if a ¢ Fv(¢); 

3) F (Vad — Vydy) and — (Vydy — Va@) if y does not occur in ¢; 

4) fT E @ then y — Vad provided x does not occur free in any formula in T; 


5) ff @ET thenlT — ¢; 

6) fT Eo andl CI” thenT’ — ¢; 
)ifTE¢ andTE(¢> yy) thenT EW; 

K= xx; 

9) F= xy > ($7, > $}) provided both x and y are substitutible for z in ¢. 


Logical equivalence is defined as in sentential logic. 

DEFINITION 3.8. Formulas ¢,w of Lare logically equivalent, written @ FA v, 
iff {p} Fp and {v} F ¢. 

Note, for example, that for any ¢, 
dad FA AVan7d. 


Together with equivalence from sentential logic this enables us to concludeL 


THEOREM 3.5. For any $(41,...,%n) € Fmg there is some $*(21,...,%n) € 
Fme such that ¢ FA ¢* and ¢* is built using only the connectives =, and only 
the quantifier V. 


For example, if ¢ is 


Vady(Ray V Ryx) 
then ¢* would be 
VaeWy7(>Raey > Ry). 

We have been a little lax in one matter—technically, all our definitions are 
relative to a language £. But of course a formula ¢ belongs to more than one 
language. That is: if £, £’ are first order languages and £ C L’, then Fmg C F'mg:. 
So we really have two different notions of validity here for £-formulas ¢: 

Er @ meaning 2% ¢ for all £-structures 2, 
Ec & meaning 2 — ¢ for all £’-structures 2’. 
Happily these coincide due to the following easily established fact. 


LEMMA 3.6. Assume LC L’, A’ is an L-structure, A= APL. Let d(a1,...,Ln) 
be a formula of L, a1,...,an € A. Then A’, & O(Ca,,---, Can) Uf Aa E O(Ca,;-++, Can): 
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4. Formal Deductions 


The definition of validity given in the preceding section does not yield a method 
of deciding, in a finite number of steps, whether or not a given formula is valid. In 
this section, we describe a procedure for generating the validities. A set of formulas 
each known to be valid is picked out and called the set of logical axioms. A rule is 
stated which enables us to generate more formulas in a step-by-step fashion. A finite 
sequence of formulas showing exactly how a given formula is obtained by repeated 
applications of the rule beginning with the logical axioms is called a deduction of 
the given formula. Since the rule preserves logical validity all formulas which have 
deductions are valid. In the next chapter we will prove the converse, that all valid 
formulas have deductions. 

This whole process is syntactical and capable of being automated. That is, 
whether or not a formula is a logical axiom can be determined, in a finite number 
of steps, by looking at the form of the formula, and whether or not the rule applies 
to yield a given formula from some other formulas is also determined just by looking 
at the form of the formulas in question. Thus looking at a given finite sequence 
of formulas one can determine, in a finite procedure, whether or not this sequence 
is a deduction. It follows that (for languages with just countably many formulas) 
one could program a computer to generate a listing of all deductions and thus of 
all formulas which have deductions. This does not, however, mean that we have 
a procedure which will decide, in a finite number of steps, whether or not a given 
formula has a deduction. So even with the theorem from the next chapter we will 
not have a procedure which determines, in a finite number of steps, whether or not 
a given formula is valid. 

All of this generalizes to deductions from arbitrary sets [ of formulas, and 
the theorem from the next chapter will state that ¢@ is deducible from T iff ¢ is a 
logical consequence of I’. This result will then become our main tool for studying 
properties of “logical consequence.” 

In particular, our goal in this section is not so much to develop techniques of 
showing that a specific ¢ is deducible from a specific [, but to develop properties 
of the relation of deducibility which will be of theoretical use to us later. 

Before defining the logical axioms on piece of terminology is useful. 


DEFINITION 4.1. By a generalization of a formula ¢ is meant any formula of 
the form Vx1,...,V2,¢, including ¢ itself. 


Note that ¢ is valid iff every generalization of ¢ is valid. 

We will simplify our deductive system by having it apply only to formulas built 
up using just the connectives =, and the quantifier V. This is not a real restriction 
since every formula is logically equivalent to such a formula. We will continue to 
write formulas using A, V,j but these symbols will have to be treated as defined in 
terms of =,—,V in the context of deducibility. 


DEFINITION 4.2. For any first order language Lthe set A of logical axioms of 
Lconsists of all generalizations of formulas of Lof the following forms: 
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)  tautologies, 

) (Vad > $7) where t is substitutible for x in ¢, 

) Wale > ¥) > (Vad > Va), 

) (¢-> Vad) where x €¢ Fu(¢), 

) =22, 

) (=2y > (¢% > ¢j)) for atomic formulas ¢. 

We could restrict the tautologies allowed to just those of certain specified forms 


(see Chapter One Section Six). This would be preferable to certain purposes, but 
would require more effort in this section. 


LEMMA 4.1. Let é€Fme. 1) If¢€ A then Vad € A for every variable x. 2) 
IfopedA then — @. 


Our only rule of inference is known by the Latin name “modus ponens” which 
we will abbreviate to MP. As used in a deduction it allows one to put down a 
formula 7 provided formulas ¢ and (¢ > w) precede it. 


DEFINITION 4.3. Let [ C Fme. 
(a) A deduction from T is a finite sequence ¢o,...,¢n of formulas of Lsuch that 
for every 7 < n we have either 
(1) ¢ € AUT, or 
(2) there are j,k <7 such that , = (¢; — ¢i). 
(b) The formula ¢ is deducible from T, written | @, iff there is a deduction 
o,-+-,¢n from T with ¢, = ¢. 
(c) In particular a logical deduction is just a deduction from [ = 9, and ¢ is 
logically deducible, | ¢, iff OF ¢. 


PROPOSITION 4.2. (Soundness) If[ | ¢ thenT —- ¢. In particular if @ is 
logically deducible then is valid. 


PROOF. Let ¢o,...,¢n be a deduction of ¢ from I’. We show, by induction on 
i, that [ — @; for every 1 <n. Since ¢, = ¢ this suffices to show [ = ¢. Leti<n 
and suppose, as inductive hypothesis, that [ — @,; for all 7 < i. If @& € (AUT) 
then I’ — ¢;. In the other case there are j,k < i such that , = (¢; — ¢;). By the 
inductive hypothesis T — ¢; and I’ — (¢; > ¢;), and sol — qj. 


LEMMA 4.3. Let T C Fmg. 
(1) If 6€ (AUT) thenTF ¢. 
(2) fT é andT + (6 > y) thenT Fw. 


ProoF. (of part 2) Let ¢0,...,¢n be a deduction of ¢ from T and let qo,..., Um 
be a deduction of (6 + w) from I. Then the sequence ¢o,...,¢n,W0,---, Wm, is 
a deduction of w from TL. 


Clearly any formula in (A UL) is deducible from T with a deduction of length 
one. The shortest possible deduction involving a use of MP will have length three. 
Here is an example: 

(Vag + 9), 
(Vad + 7@) > (6 4 War), 
(6 > 7Va-7¢). 
The first formula is a logical axiom since x is substitutible for x in any ¢, and 
o;, = ¢. The second formula is a tautology, and the third follows by MP. 
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This example shows that + (¢ > 7Va-7¢@) for every 6 € Fmg. Recalling our 
use of defined symbols, it may be more intelligibly expressed as F (¢ > dard). The 
reader should try to establish that | Va(¢ > dad). 

Due to our restricting the connectives and quantifiers allowed in formulas, every 
non-atomic formula has either the form 7¢ or (¢ > w) or Vad. We proceed to give 
several results which characterize the conditions under which such formulas are 
deducible from I’. These results can then be used to show deducibility of formulas. 


LEMMA 4.4. Deduction Theorem For anyT CFmgz, ¢,~ € Fmg: 
PU{o} Fr pb fT (gy). 


ProoF. The implication from right to left is an easy consequence of Lemma 
2.5.3 part 2 above-i.e. of MP. 

For the other implication, supppose w,..., Wn is a deduction from TU {¢} of 
w. We show, by induction on i, that [+ (¢ > w;) for all i <n. Since wy; = w this 
will establish T + (6 > w). So let ¢ < n and assume as inductive hypothesis that 
TH (¢ > yy) for all j <i. 

There are two cases. If w; € AUT then I + wy; hence T+ (@ > Wy) by 
MP since (W; > (¢ > ¥;)) is a tautology. If, on the other hand, 7; follows 
by MP then there are j,k < i such that vy = (Wj; — y). By the inductive 
hypothesis, ! ' (¢ > #,;) and TF (¢ > (#; > ¢;)). Use of MP and the tautology 
(d > (W; > Wi)) > (6 > 45) > (¢ > Y;)) yields the conclusion TF (¢ > y). 


The use of the Deduction Theorem is to reduce the question of finding a de- 
duction of (¢ > w) from T to that of finding a deduction of w from IU {¢}. This 
second question will usually be easier since w is shorter than (¢ > w). 

Our first reduction for universally quantified formulas is not completely satis- 
factory, but will be imporved later. 


LEMMA 4.5. (Generalization) Assume x does not occur free in any formula 
inT. ThenT 6 ofl Vad. 


PrRooF. The implication from right to left is easily established. For the other 
direction, suppose ¢9,...,@n is a deduction from TI of ¢. We show that [| Va; 
for all i <n, by induction. So, let ¢ <n and suppose as induction hypothesis that 
[+ Va@; for all 7 < i. If @ € A then also Ved; € A and thus [F Vad,;. If dj € T 
then x € Fu(¢;) hnce (¢; > Vxg;) € A and thus [+ Va; by MP. If ¢; follows by 
MP then there are j,k <i such that @, = (¢; > ¢;). By the inductive hypothesis, 
[+ Ved; and + Va(¢; > ¢;). Now (Vx(¢; > ¢:) > (Vad; > Vxd;)) € A so two 
uses of MP yield I’ + Vad, as desired. 


To remove the restriction in the statement of Generalization, we first prove a 
result about changing bound variables. 


LEMMA 4.6. Assume the variable y does not occur in @. Then (1) + (Vad > 
Vydy) and (b) F (Vydy > Vae). 


PROOF. (a) Since y is substitutible for x in ¢, Vy(Vr@ > 7) € A. Using 
an appropriate axiom of form 3) and MP we conclude F (VyVrd — Vy¢7). Since 
y ¢ Fu(Vr¢) we have (Vzd — VyVx¢d) € A and so F (Vx¢ — Vyd) using MP and 
an appropriate tautology. 
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(b) One first proves that, since y does not occur in ¢, we have that x is substi- 
tutible for y in ¢j and that (¢7)% = ¢. This result then follows from (a). Details 
are left to the reader. 


COROLLARY 4.7. (Generalization) Assume y does not occur in ¢ and y does 
not occur free in any formula inT. Then oy iff TF Vad. 


ProoF. The implication from right to left is easy, so we just establish the other 
direction. If 7 then the first form of Generalization yields [+ Vydy. But the 
above lemma implies [ F (Wy@j + Vx), so we conclude [ F Vag. 


Thus to show Vaz@ is deducible from some T in which x occurs free, we first 
choose y not occurring in ¢ and not occurring free in [ and then show [ F @y. 
Since we virtually always are considering only sets [' which have just finitely many 
free variables total, this choice of y is not a problem. 

Before considering formulas of the form —¢, we introduce the important notion 
of consistency and use it to characterize deducibility. 


DEFINITION 4.4. (1) The set T’ of formulas is inconsistent iff there is some 
6 € Fm¢ such that [+ 6 and I+ 76. (2) The set T is consistent iff [ is not 
inconsistent. 


We first note the following easy characterization of inconsistency. 
LEMMA 4.8. A set’ C Fmg is inconsistent iff [+ @ for every @ € Fme. 


PrRooF. The implication from right to left is clear. For the other direction, 
suppose [+ @ and T+ 76. For any ¢ € Fmg, (0 > (70 — ¢)) is a tautology, hence 
TF @ with two uses of MP. 


The following theorem enables us to reduce deducibility to (in-) consistency. 


THEOREM 4.9. Let T C Fme, @ € Fme. Then T + 6 iff f U {b} is not 
consistent. 


Proor. If ' ¢ then TU{¢} is inconsistent since both ¢ and 7¢ are deducible 
from it. If [ U {¢} is inconsistent then, by the preceding lemma, we see that in 
particular TU {-=¢} + ¢@ and so T+ (-¢ > @), by the Deduction Theorem. But 
((-¢ > ¢) > ¢) is a tautology, and so we conclude [| @. 


In particular we derive a method of showing the deducibility of formulas of the 
form 7¢@. 


COROLLARY 4.10. (Proof by Contradiction) TU {¢} is inconsistent iff TF 
ad. 


This may not actually be very useful, since showing I U {¢} is inconsistent 
is completely open-ended—what contradiction 6, = you should try to derive is 
unspecified. As a prcatical matter of showing the deducibility of 0 it is usually 
better to use one of the following, if at all possible. 


LEMMA 4.11. (1) PE ¢ iff PE 6. (2) PE A(O > W) iff Eo and -W. 
(3) TU{O} Fv TU {7g} F 76. 
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The proofs are immediate consequences of appropriate tautologies and are left 
to the reader. 

As an example of showing deducibility using the established rules, we show 
that + (daVyd > Vyda¢), that is F (AVaVyd > Vy7Vaend). By the Deduction 
Theorem it suffices to show ~Vaz7Vy¢ - Vy7Va7¢; by Generalization (y not being 
free in ~Va—Vy¢) it suffices to show -Va-Vy¢d + =Vaerd. By Lemma 2.5.9 part 3 
it suffices to show Va7¢@ + Va-Vyd. By Generalization (since x ¢ Fu(Vr7¢)) it 
suffices to show Vag F =Vy¢. Finally, by the corollary “Proof by Contradiction” 
(nothing else being applicable, it suffices to show [ = {Va-¢, Vy¢} is inconsistent. 
But this is now easy, since [ ng and TF ¢. 

The “Soundness” result from earlier in this section has the following form ap- 
plied to consistency. 


COROLLARY 4.12. (Soundness) Assume T C Fmz is satisfiable. Then T is 
consistent. 


PROOF. Suppose I is inconsistent. Then + Va = xx. So by Soundness we 
have . — Va = xa. Thus, if T is satisfiable on 2 then necessarily, 2 EF Van = ra— 
which is impossible. 


The Completeness Theorem proved in the next chapter will establish the con- 
verses of the two Soundness results, that is we will conclude the following equiva- 
lences. 


THEOREM 4.13. Let 1 C Fme. Then (1) for any é@€ Fmc, Tt 6 iff T 
(2) T is consistent iff T is satisfiable. 


l 


0; 


The importance of this result is that facts about deducibility and consistency 
can be translated into facts about logical consequence and satisfiability. The most 
important such general fact is the translation of the easy finiteness property of 
deductions. 


LemMA 4.14. (1) TF ¢ ifffo' @ for some finite p9 CT. (2) T is consistent 
iff every finite 9 CT is consistent. 


Both parts of the lemma are immediate from the fact that any specific deduction 
from I uses just finitely many formulas from [ and thus is a deduction from a finite 
subset of I. 

Using the Completeness Theorem the Finiteness Lemma becomes the highly 
important, and non-obvious, Compactness Theorem. 


THEOREM 4.15. Let l C Fme. Then (1) T E 6 iff To E @ for some finite 
To CT; (2) T is satisfiable iff every finite To CT is satisfiable. 


For the proof of Completeness we will need two further facts about deducibility, 
both of which concern constant symbols. Recall that we defined ¢7 as the result 
of replacing all occurrences of c in ¢ by the variable y. The resulting formula has 
no occurrences of c, and (¢%,)¥ = @ provided y does not occur in ¢. The content of 
the following lemma is that this substitution preserves deducibility from sets T° in 
which c does not occur. 


LEMMA 4.16. Let c be a constant symbol of Lnot occurring in any formula in 
T. Let $0,..-,¢n be a deduction from T and let w; = (b:)5 where y is a variable 
not occurring in any of do,-.--,¢n- Then wWo,..-,Wn ts also a deduction from T. 
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ProoF. If ¢; € T then vy; = ¢; since c does not occur in any formula in I. If ¢; 
follows by MP by @,;, ¢, then it is easily checked that 7, likewise follows by MP from 
Wj, Wx. It thus suffices to show that y; € A if d; € A. This is tedious-especially for 
tautologies—but not essentially difficult, so we leave it to the reader. 


Our first corollary of this is yet another form of Generalization. 


CoROLLARY 4.17. (Generalization on Constants) Assume c does not occur 
inT and @%. Then TF Vad. 


PROOF. Let ¢o0,...,¢n be a deduction of ¢% from T and let y be a variable 
not occurring in any of $0,.--,@n. Let Wi = (¢i)5- Then Yo,..-, Wn is a deduction 
from I, by the lemma, and hence from 


To = {¢i| i ET,i <n} = {Wil vi €T,i <n}. 
Thus [9 F (¢2)5. But (¢2); = ; since y does not occur in ¢¢. Further y does 


not occur (free) in any formula of 9, so the second form of Generalization yields 
To + Vad, and sol + Va. 


The second consequence of this lemma concerns the result of changing lan- 
guages. Suppose £ C L’,T C Fmz, ¢ € Fm¢. We then really have two different 
definitions of “¢ is deducible from I” according to whether the deduction consists 
only of formulas of £on whether formulas of £’ are allowed. Let us express these as 
Tre ¢,T L£'¢. Clearly, if Fe ¢ then fg d. The converse is much less clear. 
We are, however, able to prove this now provided that L’ — £ consists entirely of 
constant symbols. 


THEOREM 4.18. Assume LCL’ and L' — £ consists entirely of constant sym- 
bols. Let! C Fme. Then (1) for any d € Fmc, T ke 6 iffl ke @; (2) T 
is consistent with respect to £L-deductions iff T is consistent with respect to L'- 
deductions. 


ProoF. (1) Let ¢o,...,¢n be an £’-deduction from T of ¢. Let co,...,¢m list 
all constants from £’—£ appearing in this deduction-so this is an (LU{co,..., Cm }- 
deduction. Let w? = (:);° for each i = 0,..., where yo is a variable not occurring 
in any of ¢o9,..-,¢n. Then by the lemma w,..., Y° is a deduction from T consisting 
of formulas of LU {c1,...,Cm}. Since dn = ¢ € Fmg we have W? = dn, so this 
is still a deduction of ¢. Repeating this for c,,...,¢m we eventually arrive at a 
deduction from T of ¢ consisting just of formulas of L. 

(2) This follows immediately from (1). 


5. Theories and Their Models 


There are two different paradigms for doing mathematics. One is to study all 
structures in some class defined by certain properites. The other is to study some 
specific structure. An example of the first would be group theory, which investigates 
the class of all structures satisfying the group axioms. An example of the second 
would be real analysis, which studies the particular structure of the real numbers. 

Both of these paradigms have counterparts in logic. What is charateristic of 
the logical approach in both cases is that the properties used to define the class of 
structures, and the properties of the structures themselves, should be expressible 
in first order logic. To begin with we concentrate on the first paradigm. 

First some terminology. 
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DEFINITION 5.1. Let % C Sn¢. Then the set of consequences of © is 
Cne(X) = {0 € Sne| © — 6}. 


Note that if £ Cc £’ and & C Sng then Cn¢(H) C Cng(U). Nevertheless we 
will frequently omit the subscript Lif there is no chance of confusion. 


DEFINITION 5.2. Let & C Sn-. Then the class of models of ¥ is 
Modc(%) = {L-structures 2] 2 dS}. 


We note the following easy fact. 


LEMMA 5.1. Let 41,42 C Sng. Then Cn(1) = Cn(Xe) iff Mod(X\) = 
Mod(X2). 


We think of a set © of sentences as the axioms of a theory. Then Mod(=) is 
the calss of models of the theory, and C'n(X) is the set of theorems of the theory, 
that is the set of sentences true on all models of the theory. By the above lemma, 
two sets of sentences have the same models iff they have the same consequences. In 
this case we will consider them to both define the same theory. This is conveniently 
captured in the following definition. 


DEFINITION 5.3. (1) By a theory of Lis meant any set of sentences of Lof the 
form T = Cn¢(X), © C Sng. (2) If T is a theory of £then any set © C Sng such 
that T = Cnc(X) is called a set of axioms for T. 


LEMMA 5.2. Let T C Sng. Then T is a theory of Liff T = Cne(T). 


The largest thoery of Lis T = Sng. This theory has no models and can be 
axiomatized by the negation of any logically valid sentence of £, for example Vz7 = 
LL. 

The smallest theory of Lis T = {0 € Sn¢e| F- O@}. This theory is equal to 
Cnz(@) and every structure of Lis a model of it. 

In between these extremes are the theories of T which have models but which 
are not satisfied by every £-structure. One important kind of example is given by 
the (complete) theory of an £-structure. 


DEFINITION 5.4. Let 2 be an £-structure. Then the (complete) theory of 2 is 
ThA) = {a € Sng] AE co}. 


DEFINITION 5.5. Let T be a theory of £. Then T is complete iff T has a model 
and for every o € Sn¢ either 0 € T or -a0 ET. 


The following fact is easily verified. 


LEMMA 5.3. A set T C Sng is a complete theory of Liff T = Th(Q) for some 
L-structure I. In this case T = Th(A) for every AE T. 


The complete theory of 2 tells you everything about 2 that can be expressed by 
first order sentences of £. Having the same complete theory defines a very natural 
equivalence relation on the class of £-structures. 


DEFINITION 5.6. L£-structures 2 and % are elementarily equivalent written 
A= B iff Th(A) = Th(B). 


44 2. FIRST-ORDER LOGIC 


We will see later that elementarily equivalent structures may look very different— 
to begin with, they may have universes of different cardinalities. In fact, we will 
prove in Chapter 3 that whenver 2 is infinite (meaning A, the universe of 2, is 
infinite) then there is a S such that 2 = B and |A| < |B]. 

The natural question is how much “alike” must elementary equivalent struc- 
tures be? This is vague, but we will interpret this to mean what can we prove about 
the models of a complete theory? This will in fact be the central topic of much of 
Part B, in partiuclar of Chapter 5. 

Even more fundamental is the question of how we would show that two struc- 
tures 21,58 are elementarily equivalent. We won’t be able to prove directly that 
for every 6 € Sng we have 2 — o iff B & o. If for a given 2 we could explicitly 
determine (“write down”) a set © of axioms for Th(2l) then we could conclude that 
$B = 2A iff B — LX. But determining whether or not © axiomatizes a complete 
theory is of the same level of difficulty—we are not going to be able to prove directly 
that for every 9 € Sne we have either & — @ or © | 78 but not both. 

We will in fact develop some techniques for showing that a theory given ax- 
iomatically is complete, although they will be of restricted applicability. More 
importantly, we will develop techniques for showing that a theory—including one 
given in the form Th(2l)—will have models with certain properties. These tech- 
niques will not yield a complete description of the structures proved to exist, but 
they yield a great deal of information about the models of a theory. 

As a beginning step we introduce isomorphisms between £-structures and prove 
that isomorphic £-structures are elementarily equivalent. Roughly speaking, two 
structures are isomorphic provided there is a one-to-one correspondence between 
their universes which “translates” one structure into the other. 


DEFINITION 5.7. Given £-structures 2,8 a function A is an isomorphism of 
2 onto B, written h : A & B, iff h is a function mapping A one-to-one onto 
B such that (i) h(c*) = c® for all constants c € L, (ii) h(F™(a1,...,am)) = 
F®(h(a1),...,h(am)) for all m-ary function symbols F € £ and all a1,...,a@m € A, 
(iii) R™(a,,...,@m) holds iff R™(h(a,),...,h(a@m)) for all m-ary relation symbols 
REL and all a,...,am € A. 


The reader should note that this definition agrees with the familiar algebraic 
definition on algebraic structures like groups, rings, etc. Since isomorphic structures 
are “the same” except for the identity of the elements of their universes it is not 
surprising that they will be elementarily equivalent. In fact, we prove something 
stronger. 


THEOREM 5.4. Let 2,8 be £L-structures and assume h : A = B. Then 


for every &(a9,---;%n—1) € Fmg and for all ao,...,@n_1 € A we have Aa | 
(Cag:++++Can_1) Uf BBE O(co,,---,Cb,_,) where bj = h(a), i =0,...,n—1. 

PROOF. One first shows by induction on Tm, that for every t(xo,...,U%n—1) € 
Tmg and every do,.--,@n—1 € A, 


h(t™ (ao, wee ,4n—1)) _ ae (h(ao), sey h(an—1)). 


This argument is left to the reader. One next shows the equivalence in the statement 
of the theorem by induction on Fm;. We do two parts of the argument and leave 
the rest to the reader. 
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If ¢ is the atomic formula = t,t2, then the following are equivalent: 
A4 = P(Caos aac Cag a), 
t* (ao, seey Gn—1) = ts (ao, seey Qn—1); 
h(t? (ao, sey Gn—1)) = h(t3(ao, sey Qn—1)); 
a (h(ao), seey h(an—1)) = ty (h(a), seey h(an—1)), 
Bp [— (Cb; eee (i) 
The equivalence of the second and third lines follows since h is one-to-one, and the 
equivalence of the third and fourth lines follows from the preliminary lemma on 
Tme. 
Suppose $(%o,---,2n—1) = Vyw(ao,---,;%n—1,y) and, as inductive hypothe- 


sis, that the equivalence holds for w and for all ao,...,@n-1,a € A. Fixing 
ao,---;4n—1 € A the following are equivalent: 


Qa E @ (Gags eines (Caaea Ny 


Ma E W(Cay,-++)Can—1>Ca) for alla € A, 


Bet W(Chgxen-is 00g a5 Chey) for alla € A, 
Be E w(cbo,---+Cbn_1> Cb) for all b€ B, 
Bp E b(Cbo3+++3Cbn—1): 


The equivalence of the second and third lines follows from the inductive hypothesis, 
and the equivalence of the third and fourth lines follows since h maps A onto B. 


As usual we say 2% and 8 are isomorphic, written 2% = %, iff there is an 
isomorphism fh of 2 onto S. Further an automorphism of 2 is an isomorphism h 
of 2( onto itself. 


EXAMPLE 5.1. Let £be the language whose only non-logical symbol is a binary 
relation symbol R. Let 2 = (w,<) and B = (B,<) where B = {2k| k ew}. Then 
2 and % are isomorphic via the mapping h : A > B defined by h(k) = 2k for all 
k €w. All that needs to be checked is that h maps A one-to-one onto B and that 
R*(k,1) holds iff R®(h(k), h(1)) holds, that is k <1 iff 2k < 21. 


EXAMPLE 5.2. With Las in the previous example, et % = (Z,<). Then for 
every ko,lo € A there is an automorphism h of 2 such that h(ko) = lg. We leave 
the reader to check that h defined by h(k) = k + (lo — ko) works. 

Note that it follows, in this example, that for every (a) € Fmg and every 
ko, lo € A we have 


De (Ck) iff 4 & P(Cig)- 
It follows that either 6% = A or 6% = 0. 


EXAMPLE 5.3. Let Lbe the language whose only non-logical symbol is a con- 
stant symbol c. Let 2,8 be any two £-structures with |A| = |B|. Then & = B. 

Let Ag = A— {ce™}, By = B— {ec}. Then |Ao| = |Bol, so there is some 
one-to-one function ho mapping Ap onto Bo. Define h: A > B by h(a) = ho(a) 
for a € Ag and h(a) = c® otherwise. Then h: 4 = B. 
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EXAMPLE 5.4. Let Lhave as its only non-logical symbols the constants cp, for 
new. Let T = Cne({7 = Cacm| n < m € wh). Let A,B both be models of T 
with |A| = |B| > w. Then A= B. 

Let Ap = A — {c#| n € w} and let Bp = B— {c?| n € w}. Then |Ao| = 
|A| = |B| = |Bo| since A and B are uncountable. Thus there is some one-to-one ho 
mapping Ag onto Bo. Define h: A > B by h(a) = ho(a) if a € Ao, h(c*) = c®, 
all n € w. Then A is well-defined and one-to-one, since both 2,8 — T, hence 
h:AZS. 

The reader should check that this theory has exactly w many non-isomorphic 
countable models—one for each cardinality for Ao. 


The statement of the theorem above on isomorphisms does not say that 24 = 
%$p,. This usually wouldn’t even make sense since these would be structures for 
different languages. However if h : 2 = 8 then % can be expanded to an L£(A)- 
structure $* such that %4 = %* by defining (ca)® = h(a), and if we add the 
condition that B = {c®"| c € £L(A)} then this is equivalent to A & B. 


6. Exercises 


(1) Show directly from the definitions or produce a counterexample to each of the 
following. 
(a) & (arPx > VeQx) > Va(Px > Qzx) 
(b) — (Px > Qy) > (ArPx > AyQy) 
(2) Establish the following, using the rules from the notes. 
F Va(Px > Qy) > (ArPx > Qy) 


(3) (a) Let £2” = {R,d} where R is a binary relation symbol and d is an individual 
constant symbol, and let 2 = (Z,<,0). 
(i) Find an £-formula y(x) such that y* = {-1} 
(ii) Determine ~™ where w(z, y) is Vz(Rrz > =Rzy). 
(b) Let £™ = {F} where F is a binary function symbol. Let % = (w,+) and 
let B = (Q,+). Find a setence o of £ such that 2 o and BE -o. 
(4) Let £ be any language and let ® = {y, :n € w} C Fme. Assume that © is 
inconsistent. Prove that there is some n € w such that 


F (=99 V +++ V 7p) 


— 


(5) Establish the following: 


+ (daVyRey > SyRyy). 

(6) Prove Theorem 2.2.2 about unique readability of terms. 

(7) Let £™ = {R, E, P} where R is binary and EF and P are both unary relation 
symbols. Let 2 be the £-structure with A = w where R™ is <, E™ is the set of 
even numbers and P™ is the set of prime numbers. Give sentences of £ which 
‘naturally’ express the following facts about 2. 

(a) There is no largest prime number. 

(b) There is exactly one even prime number. 

(c) The smallest prime number is even. 

(d) The immediate successor of an even number is not even. 

(8) Let £™ = {R} where R is a binary relation symbol, and let % = (w,<). 
Determine y™ where: 
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(a) y is Vy(Ryz > 7~Ryz) 
(b) vy is Vz(Rz«x V Ryz) 


Sly 


c 
> ={aceR:0<a} 
% 
3 
% 


= {(a1, a2) €R?: aa< az} 

4 = {(a1, a2) € R*: |a1| = |aa|} 

(10) Let 2”! = {R} where R is a binary relation symbol, and let %, = (w, <), %2 = 
(Z, <), 3 = (Q,<) and A, = (ww {O},|). For each 1 < i<j <4 finda 
sentence o;,; of £ such that 2; — o;,; but 2; = 70;,;. 

(11) Fix a sequence {6;}ic., of sentences of £. For any sentence o of sentential logic 
let o* be the result of replacing all occurences of $; in o by 6; for each 7 € w. 
(a) Give a definition by recursion of o*. 

(b) Define, for each £-structure 2, a truth assignment hy such that for every 
o of sentential logic we have hy - o iff 2 o*. (Note that it follows that 
o* is valid whenever o is a tautology.) 

(12) Given ©1, U2 C Sng define &* = {0 € Sn¢ : Uy | A and Xp — 6}. Prove that 

for every £-structure 2, 2 - U* iff either A “1 or WE No. 


CHAPTER 3 


The Completeness Theorem 


0. Introduction 


In this chapter we prove the most fundamental result of first order logic, the 
Completeness Theorem. The two forms of Soundness, from Chapter 2, give one 
direction of the biconditionals, and thus we need to establish the following. 


THEOREM 0.1. Let [ C Fmg. Then: (1) for any @ € Fmg, ifT — ¢ then 
TF ¢; (2) if T is consistent then T is satisfiable. 


Further, due to the equivalences + ¢ iff [ U {76} is not satisfiable, it suffices 
to establish (2) above. As we will argue in more detail later, the full version of (2) 
for sets [’ of formulas with free variables follows from the version for sentences. We 
therefore concentrate on establishing the following Model Existence Theorem. 


THEOREM 0.2. Every consistent set 4 of sentences of Chas a model. 


In our proof of this result we will first define a special sort of sets of sentences 
called Henkin sets—these sets will look roughly like sets of the form Th(2l4). We will 
show that Henkin sets determine structures which are their models, in essentially 
the same way in which we could recover 24 from Th(2l4). This will be done in the 
next section. In Section 3.3, we will show that if 4 is any consistent set of sentences 
of £, then © CT for some Henkin set T of sentences of £’, where L’ is a language 
obtained from Lby adding new constant symbols. This will finish the proof of the 
Completeness Theorem. 

In Section 4, we derive two of the most important consequences of the Com- 
pleteness Theorem. These are the Compactness Theorem and Lowenheim-Skolem 
Theorem. We then give several applications of these results to the study of theories 
and their models, as initiated in the preceding chapter. 


1. Henkin Sets and Their Models 


How could we use a set of sentences to define a structure which will be a model 
of the set? In defining a structure you need to specify the elements of the (universe 
of the) structure and specify the interpretations of the non-logical symbols of the 
language on the universe. For this information to be provided by a set of sentences, 
the language should contain constants for all elements of the structure and contain 
all the sentences with these constants true on the structure. Thus it should be 
roughly like Th(2_4) in the language L(A). 

The trick here is to decide on the necessary properties the set of sentences 
should have without knowing 2 to begin with. The following definitions collect all 
the properties we will need. 


DEFINITION 1.1. Let [TC Sng. 
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a) [is a complete set of sentences of Liff 

i) for every sentence 0 of £70 €T iff 6 €T, and 

ii) for all ¢,w € Sng, (6 > Vv) ET iff either ->¢d ET or WET. 

b) T’ has witnesses iff for every (2) € Fmg we have Va¢ € T iff d(c) € T for 
all constants c of L. 

c) T’ respects equality iff the following hold for all closed terms 

BE Pinkston Piensit OL 

i) =tteT, 

ii) if =tl’ eT thn =tteT, 

iii) if= tito T and = tots T then = tits E T, 

iv) if =¢t) €T for alli=1,...,m then = Ft,...t,Ft,...t(, €T for every 
m-ary function symbol F of ZL, 

v) if=¢ ti €T for alli=1,...,mandif Rt, ...t, holds where R is an m-ary 
relation symbol of Cthen Rt’,...t),, ET, 

vi) there is some constant c with =tc ET. 


Note that we continue in this chapter to assume that all sentences are written 
using just the connectives =,—> and the quantifier V. A set [ can be a complete set 
of sentences without being a complete theory, since it need not have a model. The 
additional properties are needed to guarantee that the set has a model. 

If 2 is any £-structure and we define T = Th(24) then T is a complete set 
of £(A)-sentences which has witnesses and respects equality. Our first goal is to 
see that every complete set of sentences with witnesses which respects equality 
determines a structure which is a model of the set. 


DEFINITION 1.2. Let I C Sn¢ and let 2 be an L-structure. Then 2 is a 
canonical structure determined by [ iff the following hold: 
i) a= {c™| cis a constant of L}, 
ii) for each m-ary relation symbol R of Land all c1,...,¢m € L, 
R* (ct, ...,c%) holds iff Rey ...¢m €T., 
iii) for all c,,c2 constants in L, ei = oy iff = cyco ET, 
iv) for each m-ary function symbol F of Land all c1,...,¢m,d € L, F™(c#,... 
d™ iff = Fc,...c,d €T. 


Note that a given T may not determine any canonical structure, but any two 
canonical structures determined by [I are isomorphic. Thus we will speak of the 
canonical structure for I. 24 is the canonical structure determined by Th(2l,). 


THEOREM 1.1. Let I be a complete set of sentences of Lwhich has witnesses 
and respects equality. Then T determines a canonical structure. 


ProoF. Let C be the set of constant symbols in £. We define the relation ~p 
on C by c xp d iff = cd € T. Then ~pf is an equivalence relation on C, since [ 
respects equality. 

For c € C we define c/~p= {dE C| cE Ch. 

We now proceed to define the structure 2%. Let A be any set such that |A| = |{c/~r 
| ce C}I, and let h: {c/~r | ce C} > A be one-to-one and onto. 

For any constant c € £ (hence in C), define c™ by c* = h(c/~r). 

For any m-ary relation symbol R of Land any c1,...,¢m € C, define R™ by saying 
R*™(c#,...,c%) holds iff Re: ...¢m ET. 

For any m-ary function symbol F of Land any c1,...,¢m,d € C define F* by 
PUG age ad” it = Fe the. 
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Then R™, F*™ are well-defined since I respects equality, and also c?! = c3! iff 
=c Co € I. Thus & is a canonical structure determined by I, as desired. 


THEOREM 1.2. Let T be a complete set of sentences of Lwhich has witnesses 
and respects equality. Let 2 be the canonical structure determined by T. Then 
AET. 


PROOF. We first must show that for every closed term t of Land every constant 
symbol ¢ of L, if = ct € T then t™ = c*. We leave this proof, by induction on Ih(t) 
and using the assumption that I respects equality, to the reader. 

We now show, by induction on /h(0) for 6 € Sng, that 1 6 iff@ eT. 

As inductive hypothesis suppose this equivalence holds for all 6 € Sng with 
Ih(@) <n. Now consider @ with lh(@) =n. There are several cases. 

If 6 is = tity for closed terms t;,t2 then (since I respects equality) there are 
constants c;,c2 € £ such that = cit,,= cotg € T. By the preliminary result on 
terms, t?! = c?! and ts! = c3'. Since T’ respects equality we know = cic2 € I, hence 
c# = cs by the definition of canonical structure. Therefore t?! = t3!, hence 2 | 0. 

The argument is similar when 6 is Rt, ...t,, and therefore left to the reader. 

The cases in which 6 = 7¢ or 6 = (¢ > w) are easy using the hypothesis that 
T is a complete set of sentences. 

We conclude by considering the case 6 = Vx¢(x). Then, since 2 is a canonical 
structure we have 2 — 0 iff 2 — ¢(c) for all ce L. Now 


Ih(d(c)) = 1h(@) < 1A(@) 


so the inductive hypothesis holds for every ¢(c). Therefore we see & — 6 iff d(c) ET 
for all c € £. But T has witnesses, so ¢(c) € T for all c € £L iff Vad € T. Thus 
2 6 iff 6 © T, which finishes the proof. 


Note that we have nowhere assumed that the set I’ is consistent, although it 
follows from having a model. When we add consistency, the list of other properties 
T must have for the preceding two theorems to hold can be considerably shortened. 


DEFINITION 1.3. Let ! C Sng. Then I is a Henkin set of sentences of Liff the 
following hold: 

(i) Tis consistent, 

(ii) for every 0 € Sng either 0 €T or 70 ET, 

(iii) for every formula (x) of Lif ~Vr¢d(x) € T then =¢(c) € T for some c € L. 


LEMMA 1.3. Let T be a Henkin set of sentences of £L. Then for every @ € Sng, 
eEelifflk ¢. 


ProoF. From left to right is clear. For the other direction, if ¢ ¢ T then 
a¢ €T (by condition (ii) in the definition of Henkin sets), so / =¢@ and therefore 
.'¥ ¢ by the consistency of I. 


We thus establish the following theorem. 


THEOREM 1.4. Let T C Sng. Then T is a Henkin set iff [ is complete, has 
witnesses, and respects equality. 


ProoF. If I is complete, has witnesses and respects equaltiy then we have 
shown that [ has a model, so [I is consistent and therefore a Henkin set. 
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For the other direction, assume [ is a Henkin set. Then IT is easily shown to 
be complete, using the lemma. Similarly [ is easily shown to have witnesses, using 
the lemma and the fact that 


TF (Vae(2) > o(c) 


for all constants c of £. Further, I is easily seen to respect equality, using the 
lemma and noting (for condition (vi)) that k ~Va5 = ta hence -Va7 = ta ET, 
and so =tc €T for some c. 


To summarize, we have shown that every Henkin set of sentences has a model, 
which is in fact the canonical structure it determines. In the next section we show 
that consistent sets of sentences can be extended to Henkin sets of sentences—in 
some larger language. 


2. Constructing Henkin Sets 


The work in the preceding section reduces the Model Existence Theorem at the 
beginning of section 3.1, to the following, purely syntactical result. 


THEOREM 2.1. Let U be a consistent set of sentences of £. Then there is some 
L' DL and some Henkin set T of sentences of L’ such that 5 CT. 


It is clear that we will normally be forced to have £L 4 L’ in this theorem 
(for example, may not have any constant symbols). On the other hand, we will 
need to guarantee that the set © we begin with remains consistent with respect to 
£'-deductions. Thus we will need to have that (£’ — £) consists just of constant 
symbols. 

Given L£, the question is: how many new constant symbols must we add to Lto 
obtain a language L’ such that the above extension result can be established? The 
answer—which is easier to justify after the fact—is that |£| new constants will suffice. 

We give the argument in detail for the case in which Lis countable, after which 
we indicate how the general case could be proved. 

So, fix a countable language £. Let C' be a countably infinite set of individual 
constant symbols not in Land let L’ = LUC. Then L’ is also countable thus in 
particular |Sinc/| = w, so we may list the sentences of L’ as Sng = {o,| n € w}. 

Now let © be a consistent set of sentences of £. Then © remains consistent 
with respect to £’-deductions, as established by Theorem 2.4.5. We wish to define 
a set I of sentences of £’ which is a Henkin set and contains ©. We will do this by 
defining, by recursion on w, a chain {[,}ne., of subsets of Sng, whose union is the 
desired Henkin set. This chain will have the following properties: [9 = % and, for 
each n € w: 

(On) T, © Pr4i, Pn41 is consistent, and (T,41 —T,) is finite; 

(1n) either o, € Tn41 or mon € Pn; 

(2n) if o, = Vx¢(x) for some ¢(x) and if a0, € T,41 then a¢(c) € T,41 for 
some cE C. 

Assuming {Pn }new is such a chain we show that T = U,,¢,, Pn is a Henkin set 
of sentences of L’. 

We first show I is consistent. If not then some finite I’ C T is inconsistent by 
the Finiteness Lemma. But then I’ CT,41 for some n € w and so [n41 would be 
inconsistent, contradicting (On). 

Next, let 9 € Sng. Then 6 = a, for some n € w and so, by (1n), either 
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8=o0n €Pn4i CT or 

46 = s0y ET nai CF. 

Finally, if (2) € Fmg and if sVrd(x) € T then Va¢(x) = oy for some n € w 
and necessarily so, € Tyn41 (since on € Pn41 CT would contradict the consistency 
of T), and so, by (2n), -¢(c) € Tn4i CT for some constant c. 

We now show how to construct such a chain {T,}new by recursion. Suppose 
we have U=Tp CT, C-:- CT, satisfying (0i), (1i), (2i) for alli <n. Then TI, is 
consistent and |[,, — X| < w. We show how to define T',41 so that (On), (1n), (2n) 
hold. We obtain T’,41 in two steps. The first will guarantee (1n) and the second 
will guarantee (2n). To ensure (On) we will only need to preserve consistency in 
performing these steps. 

We define [,, 44 as follows: 
=T,, U{on} if, U {on} is consistent; 

n¢h =TnU {79} if Pn U {on} is inconsistent. 

We claim that in either case I’, , 1 is consistent. This is clear in the first case. If, 
on the other hand, T,,U{o,,} is inconsistent then I, / sa, and so heres =T,,U{on} 
is consistent because I’,, is consistent. 

From Tas we define [,,,1 as follows [,41 = Paes if either a, € Veet Or On 


Tr 


n+ 


NIH 


a 


does not have the form Vxr¢; Pn41 =P ,41 U {7G(c)} where c € C' does not occur 
in any sentence in ere if o, = Ved(x) and 70, € ree We must show [41 is 
consistent in the second case. If I’,,, 1 U {79(c)} is inconsistent then I’, 1 F @(c). 
But c does not occur in I’,,,1 nor in g(x), and so Generalization on Constants 
yields T,,,1 | Va@(x). But this contradicts the consistency of T’,,, 1. Finally note 
that there will be constants in C’ not occurring in I, 1 since no constants in C’ 
occur in &, |P,41 — &| <w, and C is infinite. 

This completes the proof of the theorem stated at the beginning of this sec- 
tion for countable languages £. The same outline may be used for uncountable 
languages, but now transfinite recursion is needed. 

Suppose |£| = « and let £L’ = LUC where C is a set of new individual constant 
symbols, |C| = «. Then |£’| = & hence |Sng| = & and so Sng’ can be listed as 
Sno = {oe| € € k}. We define, by recursion on «, a chain {['e}ec,, of subsets of 
Sne so that & =To and for all € € & we have 

(0€) Te CT e41, Peqi is consistent, and (T¢41 — Te) is finite; 

(1€) either og € PE +1 on mo €T e413 

(2€) if o¢ = Vae(x) and m0¢ € T'e41 then -¢(c) € T¢41 for some c € C; 

and further if € is a limit ordinal then Te = U, 2 Tv. 

The construction proceeds exactly as before noting, for (2€), that is € € « and 
(Ov) holds for all v < € then |[~ — 4] < & = |C| and hence there will be constants 
in C not occurring in I’, U {70¢}. 

Since a Henkin set [ completely determines a canonical structure up to isomor- 
phism, we can control some properties of the canonical structure by constructing a 
Henkin set with specific properties. We will exploit this later, especially in deriving 
the Omitting Types Theorem. 

Note that if © is a consistent set of sentences of 2then our proof yields an L- 
structure which is a model of }—namely the reduct to Lof the canonical £’-structure 
determined by the Henkin set I of £’-sentences we construct containing ©». 
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3. Consequences of the Completeness Theorem 


We very seldom will use the Completeness Theorem directly, thus we will not 
prove that ta set © of sentences has a model by showing that it is consistent. Instead 
we will derive two purely semantic (or, model theoretic) consequences which will 
give us virtually everything we will need from Completeness, for the purposes of 
model theory. (A third consequence will be given much later in the context of 
decidability and the decision problem). 

The first of these consequences is the Compactness Theorem, a direct transla- 
tion of the Finiteness Lemma. We will argue for the full version later. Since we 
have established Completeness for sets of sentences we can at this point conclude 
the following version. 


THEOREM 3.1. (Compactness) Let % C Snr. 
(1) For any @ € Fmc, UE ¢ iff Xo E ob for some finite Xo CU. 
(2) © has a model iff every finite Xo CX has a model. 


To allow formulas ¢ in part (1) we use the fact that if d = d(xo,...,2@n) then 
SE @¢ iff & & Vao--- Vand. 

The force of the Compactness Theorem in form (2), as we will see in many 
examples, is that you may be able to produce a model 2s, of each finite Mo C U 
without knowing how to define a model of the entire set ¥. 

The second consequence is actually a consequence of our proof of the Com- 
pleteness Theorem using canonical structures determined by Henkin sets. 


THEOREM 3.2. (L6wenheim-Skolem) Let « = |L| and assume that & C Sne 
has a model. Then & has a model 1 with |A| < K. 


PROOF. Since © has a model, and is therefore consistent, our proof of the 
Completeness Theorem produced an L£-structure 2 which is a model of © and 
which is the reduct to Lof the canonical structure 2’ determined by a Henkin set 
of sentences of some L’ where |L’| = «. But 


A= A! ={c™|ce€L’ is a constant}. 
Therefore |A| < |L’| = k. 


As a first application of the L6wenheim-Skolem Theorem note that if 8 is any 
L£-structure, |C| = «, then there is some £-structure 2 with |A] < « such that 
2 = B. This is immediate by considering © = Th(B). The reader should consider 
what such a structure 2% would be if, for example, B® is (R,<) or (R,<,+4,-)- 
both structures for countable languages, hence elementarily equivalent to countable 
structures. 

The Compactness Theorem is one of the most important tools we have in model 
theory. We give here some easy, but typical, examples of its use. 

To begin with the Compactness Theorem, in its second form stated above, 
is a model existence theorem-it asserts the existence of a model of %, given the 
existence of models of every finite subset of &. We will frequently use it to prove 
the existence of structures with certain specified properties. To do so we attempt 
to express the properties the structure should have by a set of sentences, and then 
use Compactness (or, later, other similar results) to show the set of sentences has 
a model. 

As a first example of this procedure we prove the following result. 
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THEOREM 3.3. Let % C Sn¢ and assume that for everyn € w, & has a model 
A, with |A,| >n. Then X has an infinite model. 


PrRooF. Our conclusion asks for an £-structure 2% with two properties: 

AE U and 

|A| is infinite. 
Recall that we know a set ©,, of sentences (with no non-logical symbols) such for 
every £-structure 2, |A| is infinite iff 2 ©,,. Thus what we want to prove is 
precisely that (XU 0,,) has a model. By compactness it suffices to show that every 
finite Xo C NU, has a model. Let Ho C NUO,, be finite. Then Ho C NU {O,| 2 < 
k < n} for some n € w. But the hypotheses to this theorem provide a model 2, of 
» with at least n elements. Therefore 2, F © U {6,| 2<k <n} and so 2, - No. 
Thus © U ©,, has a model, as desired. 


This theorem enables us to answer some questions we raised earlier. 


COROLLARY 3.4. There is no set Of of sentences (of any £L) whose models 
are precisely the finite £L-structures. In particular, there is no sentence 6, whose 
models are precisely the infinite £-structures. 


Proor. Such a Oy would have arbitrarily large finite models, hence it would 
have infinite models by the theorem—a contradiction. 


We give now a second proof of the above theorem, indicating another technique— 
changing languages. 


PRooF. We show another way to express “A is infinite” with a set of sentences. 
Let cn, n € w be distinct constant symbols not in £. Let L’ = LU {c,| n € w} 
and let 3! = {4 = cpcm| n <m €w}. Then an L£-structure is infinite iff it can be 
expanded to an £’-structure which is a model of ©’. Thus to show that the set © 
of sentences of £has an infinite model it suffices to show that (XU X’) has a model, 
since the reduct to Cof such a model will be as desired. By compactness it suffices 
to show that every finite Xy> C © UX’ has a model. If Ho C UU” is finite then 
Yo C VU{A = Cncm| n <m < k} for some k € w. Consider 2,41-the model of © 
we are given which has at least k + 1 elements. We show how to expand it to an 
L'-structure which will be a model of Ng. Pick 0, +++, Ak € A, all different. Define 


; 2 
2). to be the expansion of 2,41 such that cn *** 


Qi, , is as desired. 


= ag for alln > k. Then clearly 


The advantage of this second proof is that it generalizes to yield models of 
cardinality > « for every «. All you need to do is add & new constants and consider 
the set of sentences asserting they are all different. By combining this with the 
Léwenheim-Skolem Theorem we obtain an important stregthening of that result. 


THEOREM 3.5. Let |£| = « and assume that % C Sn¢ has an infinite model. 
Let X be a cardinal with K <A. Then X has a model of cardinality equal to x. 


Proor. Let C be a set of constant symbols not in £, |C| = A. Let L’ = LUC. 
Note that |£’| = » since k < A. Let &’ = {4 = cdl c,d € C,c F¥ d} and define 
SS We, 

Then, as described above, 4* has a model since every finite Ny C &* has a 
model. Further, every model of 4* has cardinality > ». Since |£’| = X the original 
Lowenheim-Skolem result implies that =* has a model 2’ of cardinality < », and 
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so of cardinality exactly = A due to the property of &* mentioned previously. 
Therefore 2% = 2’ [ £ is an L-structure which is a model of © and which has 
cardinality exactly = X. 


COROLLARY 3.6. Let 2 be any infinite structure. Then there is some 8 such 
that 2= B but |A| ~ |B, in particular, AF B. 


Proor. Let © = TA(Q) and let \ be such that A > |£| and \ # |A]. Then any 
model % of ¥ of cardinality equal to X suffices. 


Given an infinite 21 can we find some $ = A such that 24 B and |A| = |B? 
Clearly this is not always possible-for example if £"/= and Y is any L-structure 
then 2 = B for every £-structure B with |A| = |B|. We will see less trivial 
examples of this phenomenon later. 

As a final application of Compactness in this section we show that there are 
structures 2, & for a countable language sucdh that |A| = |B| =w but 14 B. 


THEOREM 3.7. Let Lbe the language whose only non-logical symbol is a binary 
relation symbol R. Let A be the L-structure (w,<). Then there is some £L-structure 
$B, |B) =w, such that l=B but AFB. 


Proor. Asa model existence result, this asserts the existence of an £-structure 
% having the following three properties: B = 2%, 2A 4 B, and |B] = w. The first 
property is expressed by the first order condition 8 - Th(2). Provided that we 
can guarantee the second property by some set of sentences in a countable language, 
then the last can be guaranteed using the Lowenheim-Skolem theorem. 

To assert, with first order sentences, that 24 B we first recall that, for each 
n € w, there are £-formulas ¢,(x) such that ¢% = {n}. Thus to be non-isomorphic 
to 2l means having an element satisfying no ¢@,. 

So let £L’ = LU {d} and let ©’ = {7¢,(d)| n € w}. Then if 8’ E X’ we must 
have 


B= B' LAA 
since if h : B = 2 then X24 E 7d,(ca«) for all n € w where a* = h(d®’). Since no 
a* € A has this property we must have 8 $ 2. 
Thus to show that some countable S = 2 is not isomorphic to 2 it suffices to 
show (Th(2l) US’) has a model. 
By Compactness it suffices to show that every finite subset of (Th(2l) UX’) has 
a model. Let Xo C Th(2l) UX’ be finite. Then 


Do C TAA) U{9n(d)| n < k} 


for some k € w. We show how to expand 2 to an L£’-structure which is a model of 
Xo. Define 2’ by 2’ | L = A and d™’ = k. Then clearly 


W & Th(A) U {>bn(d)| rn < k} 


as desired. 


A careful examination of this proof will reveal that it applies much more gen- 
erally than these particular circumstances. We will give a very general version in 
the next chapter. 

Note that this proof does not provide any description of what such a 8 might 
look like. The reader should attempt to decide what a good candidate for such a 
structure would be. 
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4. Completeness Categoricity, Quantifier Elimination 


In the preceding section we have proved that for any infinite £-structure 2 there 
are £-structures 8 such that 2% = B but A 4 B. In particular, if W is (w,<) there 
is a countable S$ with this property. These proofs do not give a construction of 8 
or describe 8 completely. The question thus remains of whether we can exhibit 
such a 8 explicitly, at least given a specific . 

This leads us to the following, even more fundamental problem: 

given non-isomorphic £-structures 2,8 how can we show that 2 = 8? 

In this section we give a simple method that can be applied in some cases. 
Although not widely applicable, it will give us several important examples. We will 
improve the method later, but the problem in general is intractibly difficult. For 
example, let Cbe the language for groups, let 2 be (the £-structure which is) a free 
group on 2 generators, and let $8 be a free group on 3 generators. It has long been 
conjectured that 2l and S are elementarily equivalent, but no one has been able to 
decide the question. 

We actually will consider a variant on this problem, namely: 

given © C Sn how can we show that T = Cn(X) is complete? 

This will apply to the preceding problem in the case in which © is an explicitly 
given set of sentences, and the method would be to show both 2 and % are models 
of . 

Certainly if any two models of } are isomorphic, then T = C'n() is complete. 
But this will never be the case unless © only has models of a fixed finite cardinality. 
The method we give here, and will imporve on later, will only require that any two 
models of 4, satisfying some other requirements, be isomorphic. These methods 
will apply to (some) theories with infinite models. 


DEFINITION 4.1. Let « be a cardinal number and let T’ be a theory of some 
language £. Then T is K-categorical iff T has a model of cardinality « and any two 
models of T of cardinality « are isomorphic. 


THEOREM 4.1. (Lo8- Vaught Test) Let T be a theory in a language Land let 
k be a cardinal number, |£L| < «. Assume that T has no finite models and T is 
K-categorical. Then T is complete. 


ProoF. If T, satisfying the hypotheses, is not complete then there must be 
a sentence o of h such that both T U {o} and T U {70} have models. In fact, 
they must both have infinite models, so by the general form of the Lowenheim- 
Skolem Theorem they both have models of cardinality equal to &. But since these 
models must be non-elementarily equivalent this contradicts the hypothesis of K- 
categoricity. 


We give, without much detail, several applications of this test. 

(1) L’= 0, T = Cn(0). Then T is x-categorical for all k > 1, but T is not 
complete since it has finite models. 

(2) LY= 0, T = Cn(O,,). Then T is k-categorical for all infinite cardinals K 
and has no finite models, hence T is complete. 

(3) £“= {P} where P is a unary relation symbol. Let © be the set of all 
sentences asserting “P has at least n elements” and “=P has at least n elements” 
for all positive n € w. Then an L-structure & is a model of T = C'n(X) iff |P*| > w 
and |-P*| > w. It follows that T has no finite models and is w-categorical, hence 
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is complete; but 7’ is not «-categorical for any uncountable «. For example, T has 
exactly three non-isomorphic models of cardinality wy. 


(4) LY= {e,| n € w}, T = Cn(X) where © = {3 = cjcj| i # j}. For any 


L-structure & we define Ag = {c™|n € w} and Ay = A— Ap. We claim that, for 
models 2 and B of T, 2% = B iff |Ai| = |B,|. Supposing that 2,8 —& T and 
|A;| = |Bi|, we choose some h; mapping A, one-to-one onto B,. We then define h 
on A by h(c2) = c> for all n € w, h(a) = hi(a) for alla € Ay. Thanh: % 28 
since both structures are models of T. Now, for a model 2 of TJ, A, can have 
any cardinality, finite (including 0) or infinite, but |Ao| = w. It follows that T is 
k-categorical for all & > w but not w-categorical. 


5. Exercises 


(1) Let T be a theory of £ and o,, € Sng for alln € w. Assume that TE (on41 > 


On) for all n € w, but that T KF (a, 4 on41) for all n € w. Prove that there is 
some model 2 of T such that 2% op for every n € w. 

Give an example of a language £ (with just finitely many non-logical symbols) 
and some finite set © C Sng such that the theory T = Cn() has models and 
all models of T are infinite. 

Let £2”! = {R} where R is a binary relation symbol and let 2% = (w,<). Let B 
be such that 21 = 8 but 2 is not isomorphic to %8. Prove that there is some 
infinite sequence {b, ne. of elements of B which is strictly decreasing, that is 
R® (bn41,6n) holds for all n € w. 

Let TJ, and T> be theories of £, and assume that there is no sentence @ of £L 
such that T; E 6 and Tz — 70. Prove that (JT, UT2) has a model. [Warning: 
(T, UT2) need not be a theory.] 

Let T, and T> be theories of £. Assume that for every £-structure 2 we have 


Prove that there is some sentence o of £ such that T; = Cn(c). 


Part 2 


Model Theory 


CHAPTER 4 


Some Methods in Model Theory 


0. Introduction 


In model theory one investigates the classes of models of (first order) theories. 
The questions largely concern the “variety” of possible models of a theory, and how 
different models of a theory are related. The most basic and important results of 
model theory yield the existence of models of a theory with specified properties. 
The Compactness and Lowenheim-Skolem theorems are both such results, and form 
the basis for many important applications, some of which were covered in Chapter 
3, section 4. 

In sections 1 and 2 of this chapter we present further model-existence results 
and some of their applications, which are pursued much further in the next chapter. 
Of especial importance is the concept of a type (of elements in a model) and the 
theorem on omitting types (in a model of a theory). Two natural relations of 
inclusion between models are introduced in section 3 and the method of chains 
(especially under elementary inclusion) to construct models is introduced. 

In section 4, we introduce the “back-and-forth” method, widely used to show 
two models are isomorphic. This method will also be used in the next chapter. 


1. Realizing and Omitting Types 


If two models are non-isomorphic, how might one recognize the fact? Certainly 
if they are not elementarily equivalent or if they have different cardinalities they 
are non-isomorphic. If they are elementarily equivalent of the same cardinality, 
however, the question becomes harder, particularly for countable models. We have 
seen two models be non-isomorphic since there was a set {¢;(x)| 7 € I} of formulas 
such that in one model there was an element simultaneously satisfying every «; (2) 
while there was no such element in the other model. In the terminology we are 
about to introduce we could conclude the models were non-isomorphic since there 
was a type which was realized in one model, but omitted in the other model. 


DEFINITION 1.1. (a) A type, in the variables xo,..., 2, is a set I’ of formulas, 
with at most the variables 2o,...,@%, free. We write I'(xo,...,%n) to mean that 
only 20,.--,%n occur free in T. 


(b) T'(ao,...,@n) is realized in 2 by ao,...,@n € A iff A, E O(G,...,G,) for 
all ¢(ao,..-,@n) € T. T is realized in & if it is realized in 2 by some agp,...,@n € A. 

(c) I(ao,.--,2n) is omitted in 2 iff [ is not realized in 2, that is, for all 
ao,---,;@n € A there is some ¢ € T such that 24 / 7d(G,...,Gn). 


The connection of types with isomorphisms is contained in this easy result. 


PROPOSITION 1.1. Assume h is an isomorphism of 2 onto B. If ao,...,@n 
realize the type T'(xo,...,%n) in A then h(ag),...,h(an) realize T in B. 
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PROOF. This is immediate from the fact that 
Qa r= (ao, see Gn) iff Bp eat o(h(ao), sey h(an)). 


COROLLARY 1.2. If 22 B then A and B realize precisely the same types. 


Thus, showing that two models realize different collections of types is one way 
of showing they are non-isomorphic, even if they are elementarily equivalent and 
of the same cardinality. One can ask if this will always work. That is: if 2 = B, 
|A| = |B| = w and 2 and 8% realize precisely the same types, must 2 =~ B? 

More generally (or, rather, more vaguely) we want to use types to investigate 
the countable models of (complete) theories—how they help us distinguish between 
models, etc. For example, we know that Th((w,<,+,-,0,1)) has at least 2 non- 
isomorphic countable models. Precisely how many non-isomorphic countable mod- 
els does this theory have? We will be able to answer this question exactly by looking 
at the types realized in models of this theory. 

The two questions we want to attack immediately are: when does a theory T 
have a model realizing (omitting) a type [’? 

If T(ao,..., Un) is finite, say T = {G0(),.--, Ox(Z)}, then obviously 2 realizes 
T iff @ Aay-- 3tn( Po A+++A dn). Thus T has a model realizing this finite T iff 
T U {S29 +++ San (do A+++ A bn)} is consistent. We will normally say simply that 
T U{¢o,.--, on} ts ery or {¢0..-, dn} is consistent with T. 

If ['(xo,..-,2%n) is an infinite set of formulas we cannot write down a sentence 
saying that [ is realized. At the very least, if I’ is realized on some model of T then 
every finite [9 C T is consistent with 7. We will also say [ is consistent with T, 
or T is finitely satisfiable in T, to mean that every finite [9 C T is consistent with 
T, or equivalently, every finite [9 C I is realized in some model of T. A simple 
compactness argument yields the following: 


THEOREM 1.3. T has a model realizing ['(xo,...,%n) iff T is finitely satisfiable 
inT. 


PROOF. We only need to show the implication from right to left. First let 
Li =LU{c,..-,Cn} where co,...,Cpn are new constants. Let 
Y=TU {G(co,---,en)| pe Pe 
If 2’ E ¥ then 2’ | £L is a model of T realizing T. We show © has a model by 
compactness. Let Xo C & be finite. Then 
do c PUTO Ci tee) | gE To} 
for some finite [g CT. By assumption, T has some model 2 in which Ip is realized, 


say by ao,---,@n € A. Let 2’ be the expansion of 2 to L’ such that cf =a,;. Then 
QU’ -— do- 


Warning: This result does not say that a (fixed) model 2 realizes provided 
every finite subset of I is realized by 2. This is certainly false, as easy examples 
show. 

As a consequence of the L6wenheim-Skolem Theorem applied to the set © in 
the preceding proof we obtain: 


COROLLARY 1.4. If T has an infinite model realizing T, then T has a model of 
cardinality « realizing T for each k > |LI. 
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As an example of the use of types we have the following: 


EXAMPLE 1.1. Let T = Th((Z, <)) and let I(x, y) = {on(x, y)| n € w}, where 
on(x,y) says “x < y and there are at least n elements between x and y.” Then [ 
is omitted in (Z, <) but every finite [9 C T is realized in (Z, <), hence T is realized 
in some countable model 2 of T, which must then not be isomorphic to (Z, <). 


Let us turn to the question of when a theory T’ has a model omitting [. The 
way to approach this question is to look at the negation and ask instead when every 
model of T realizes [. One will then be led to the following concept: 


DEFINITION 1.2. T locally realizes ['(ao,...,@,) iff there is some formula 
O(x0,---,2n) consistent with T such that 


T & Va9---Vin(@ > ¢) 
for all d ET. 
We can easily see: 


PROPOSITION 1.5. Assume T is complete. If T locally realizes [ then every 
model of T realizes I. 


ProoF. Let 2 = T. Since T is complete and 6 is consistent with T, we must 
have A — Faxo---da,0. If As - O(G,...,G,_) then ao,..., ap will realize [ in 
2. 


Our goal is to obtain the converse of the preceding proposition. We will not 
need the assumption that T is complete, but we will need to assume that our 
language is countable. The fact that we do not know any good criterion for a 
theory in an uncountable language to have a model omitting a type is one of the 
primary difficulties in a satisfactory development of the model theory of uncountable 
languages, and also accounts for many of the problems in dealing with uncountable 
models even for countable languages. 


DEFINITION 1.3. T locally omits ['(2o,...,@n) iff T does not locally realize 
I, that is, iff for every formula 0(xo,...,%n) consistent with T there is some 
(ao0,---,2n) € T such that (@ A 7¢) is consistent with T. 


The theorem we are after is the following fundamental result: 


THEOREM 1.6. (Omitting Types Theorem) Assume Lis countable. Assume 
that T is consistent and that T locally omits T(ao,...,2%n). Then T has a countable 
model which omits T. 


Our proof of this result will be via the Henkin method and follow closely the 
proof of the completeness theorem. That is, we will expand our language by adding 
new individual constant symbols and expand T to a Henkin Set in the new language. 
This expansion will be done in such a way that the canonical model of the resulting 
Henkin set will omit the type IL. 

One point needs to be considered first—the definitions of locally omit and locally 
realize depend on the language that the formula 6(20,...,2n) is allowed to come 
from. Just because T locally omits T in £ (i.e. considering just 6’s in £) is no 
guarantee that it still does so in a larger language L’ (i.e. allowing 0’s in £’). The 
following lemma says that we can add constants to our language without harming 
things, and furthermore we can add finitely many axioms (in the new language) to 
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T and still have [ be locally omitted. This is the essential fact needed to build up 
our Henkin Set. 


LEMMA 1.7. Assume that T locally omits T(xo,...,2%n). Let L' D L add only 
individual constant symbols to £L. Let X’ = TU {vy,..., Wx} be consistent, where 
wi € Sng. Then X' locally omits T (in L'). 


ProoFr. Let 6’(2,...,%n) of £L’ be consistent with H’. Let co,...,¢m list all 
constants of £’ — £ occurring in any of 0’,y1,..., psix. Let yo,..-,Ym be variables 
not occurring in any of 6’,W1,...,%%. Thus 


X (0, -6-5 Ens Yor---5Ym) = (8 AL A-: A thy) o sped cm 


pene 


is a formula of £, and y is consistent with T. Finally, let 


O(x0,---,2n) = Syo-+: dymx. 
Then 0 is also consistent with T, hence there is some ¢ € T' such that (@A\-¢) is con- 
sistent with T. Therefore (yA—7¢@) is consistent with T, hence x(#o0,..-,2n,Co,---;Cm)A 
¢ is consistent with T, i.e. (@’ Ad, A---Av,% A7¢) is consistent with T, which says 
precisely that (0’ A =@) is consistent with b’. 


Having this lemma we can now proceed to prove the Omitting Types Theorem. 


ProoF. Let £L* = LUC, where Co is a set of new individual constant symbols 
with |Co| = w. Then T is still consistent and still locally omits T in £*. For 
simplicity in notation we assume that I has just one free variable, so T is (x). We 
will extent T to a Henkin Set “* of £* such that the following holds: 

(3) for every constant c of £* there is some $(x) € T such that a¢(c) € d*. 

If 2* is the canonical model of a &* satisfying (3), then for every a € A*, 
a=c™ for some c, hence 2%. / 7¢(@)-i.e. A* omits T(a). 

Let {o,,| 2 € w} be a listing of all the sentences of £*, and let {c,| n € w} list 
all constant symbols of £*. We define recursively a set i, of sentences of £* such 
that “o = T and for each n € w we have: 

On) Hy, C ae 11 and in41 is consistent, 

n) Un+1 — Un is finite, 

In) on € Sagi Or AOn € En yt, 

2n) if o, € Un41 and o, = Vx¢(x) for some (x), then =¢(c) € Hy+1 for some 


1 
2 


( 
( 
( 
( 


(8n) 70(cen) € En4i for some $(x) ET. 

We pass from %,, to U,z+41 in three steps: first we obtain Untt so that on € 
X,41 OF 77, € U,41; then we obtain X,,,s such that +¢(c) € U,,,3 for some c, 
provided o, = Vad(x) ¢ U,41-these are just as in the proof of the Completeness 
Theorem. 

Given %,,, aa finite, consistent extension of T—we know by the lemma that 
X42 locally omits T(x), in £*. Look at the consistent formula 0(7) = x = Cp. 
There must be some ¢(x) in I such that « = c, \ 74(x) is consistent with &,,, a. 
Therefore we define ©,41 = %,,43 U {7¢(cn)}, and this is as desired. 

&* = Unew Un is the desired Henkin set, whose existence completes the proof. 


COROLLARY 1.8. Let T be a complete theory in a countable language. The 
following are equivalent: 
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(i) T has a model omitting T(a1,...,%n), 

(ii) T has a countable model omitting T, 

(itt) T locally omits T. 

Even if T is not complete, (i) is equivalent to (ii), but (ii)=(iii) fails. 

The most natural examples of types consistent with a theory T are the sets of 


formulas satisfied by a fixed tuple of elements in a fixed model of JT. These types 
are also “maximal.” 


DEFINITION 1.4. (1) Given a model 2 and ao,...,@, € A, the (complete) type 
of ao,..-,@n € A is 
tpa (ao, oe An) = {0(Zo, oN ,2n)| Qa [— o(Go, ate ,Gn)}. 
(2) A type ['(%,.-.,%n) is a complete type (of T) iff T’ is realized on some model 
(of T) and for every ¢(20,..-,2n) either 6 €T or =@ ET. 


LEMMA 1.9. ['(ao,...,2n) is a complete type of T iff T = tpg(ao,-.-,@n) for 
some AE T and ag,...,Qn € A. 


A consequence of the realizing types theorem is the following: 


CorROLLARyY 1.10. Jf T'(ao,...,@n) ts consistent with T then there is some 
complete type [*(ao,...,%n) of T with Tl CI™*. 


We will sometimes say that I'(xo,...,%n) generates a complete type of T to 
mean that I is contained in exactly one complete type I'*(ao,...,2n) of T-i.e. 
whenever 2,8 — T, ao,...,@y realize [ in 2, and bo,...,b, realize [ in B then 
tpy(Go,---;Qn) = tpss(bo,---, bn). 

Just as the sets of sentences one normally writes down do not usually axiomatize 
complete theories, so the types one writes down do not usually generate complete 
types. For example, let T = Th((w, <,+,-,0,1)) and let To(a) = {7 < a2| n € wh. 
Then I is realized on all models of T which are not isomorphic to 3 = (w,< 
,+,:,0,1). The elements of any 2% / JT which realize [9(x) are “infinite”, ie. 
have infinitely many elements of A less than them. But T'o(x) does not generate a 
complete type since it does not decide for example whether or not “D | x” should 
hold (for primes p € w). In fact any way of adding “p | x” or “ap | 2” for any 
primes p € w leads to a consistent type extending ['p(z). 

Let P be the set of prime numbers in w. Let X C P. We define the type 

Px (x) =Po(x)U {p| z| pe X}U {p /a| pe P— X}. 

Then each [x is a consistent type with T. Although [x does not generate a 
complete type, we can say that the same complete type cannot contain both Ix 
and Ty, for instance X # Y, both subsets of P. Thus-there are 2” different 
complete types in x consistent with T. An element of a model realizes exactly one 
complete type, hence a countable model can realize just countably many complete 
types. Since every type consistent with T is realized on some countable model of T 
we can conclude that T has (at least) 2” non-isomorphic countable models. In fact 
an elementary lemma shows that there are at most 2” non-isomorphic countable 
models for any countable language £—just count the number of different models 
with universe = w. We thus have: 


PROPOSITION 1.11. T = Th(M) has precisely 2” non-isomorphic countable 
models. 
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In applications (as in the next chapter) we frequently need to realize or omit 
an infinite number of types simultaneously. The relevant results are as follows: 


THEOREM 1.12. Let T be a complete theory (in a countable £L). Assume each 
type T,(X0,---,Xn,) is consistent with T, k €w. Then T has a (countable) model 
realizing every Tp, k €w. 


THEOREM 1.13. (Extended Omitting Types Theorem) Let T be a theory 
in a countable £L. Assume that T locally omits each type Tx(ao,.--,@n,). Then T 
has a countable model which omits every Ty, k € w. 


2. Elementary Extensions and Chains 


To discuss how two different models of a theory “compare” on frequently wants 
to talk about one model being included in another. We have two notions of inclu- 
sion, one of which is the straightforward analogue of a subalgebra (as subgroup). 


DEFINITION 2.1. 2 is a submodel of B (or % is an extension of 2), written 
21 C B, iff A C B and the interpretations in 2 of the nonlogical symbols are just 
the restrictions to A of their interpretations in 8, that is: 

c* = c® for any constant of L; 

F*(a1,...,@m) = F?(a1,...,@m) for all aj,...,@m € A and every function 
symbol F' of £; 

R*(ay,...,@m) holds iff R®(a1,...,@m) holds, for all a,,...,am € A and every 


relation symbol R of L. 


A submodel of 8 is uniquely determined by its universe, and there is a simple 
test for checking whether a subset of B is the universe of a submodel. 


LEMMA 2.1. (1) Tf ly, As Cc BS, A, = Ag then A, = Ay. (2) X C B is the 
universe of a submodel of 8 iff X 4 0, c® € X for all constants c of L, and 
F®(a,,...,@m) € X for all ay,...,am € X and every function symbol F of L. 


EXAMPLE 2.1. Here are examples of chains of submodels: 


(1) ({0, 3}, {(0, 9), (0, 3), (3,3)}) C @, S) C (ZS) C(Q),S) C(R,S) 
(2) (w,+) € (Z,+) 


Clearly as the above examples can be used to show, 2% C 8, B — T need not 
imply 2 | 7. A more serious problem from our point of view is shown by the 
following example: 


EXAMPLE 2.2. Let 8 = (w, <), 2 = (A, <) where A = {2k +1| k €@w}. Then 
2 C B by definition. Further 2% = B (since 2% = B). If we let do(a) be Vy(a < y), 
which defines the first element in the ordering, then Bg F ¢0(0) A 7@o(1). Now 
0 ¢ A and in fact A4 — do(1). That is: the element 1 which belongs to both 
universes, satisfies a formula in one model which it does not satisfy in the other 
model. This is an undersirable property, and our second concept of extension is 
designed precisely to avoid it. 


DEFINITION 2.2. 2 is an elementary submodel of S$ (% is an elementary ex- 
tension of 21), written 2 ~< B, iff 2% C B and for every formula $(29,...,%p) of 
Land every ao,...,@, € A we have %4 - O(G,.-.,G) iff Ba —E O(G,...,G). 
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Clearly, 2 <~ 8B > 2 = % but the above example shows that 2 = B, AC B 
does not imply that 2 < S. 
Combining submodel with isomorphism yields two hybrid notions: 


DEFINITION 2.3. (1)2 &C B ( is embeddable in 8) means 2 & A’ C B for 
some 2’. (2) 2A =~ B (2A is elementarily embeddable in B) means 2 ~ 2’ ~ B for 
some 2". 


The next lemma enables us to “reverse the order” of isomorphism and inclusion 
in the previous definitions. It is important because in trying to show that 2 has 
an elementary extension with a certain property, our methods yield a model into 
which 2 can be elementarily embedded first of all. 


LEMMA 2.2. (1) A =C B iff A C B’ S B for some B’. (2) A SK B iff 
A ~< Bi ~ B for some B’. 


ProoF. From right to left is clear, so assume h : 2 & 2’ where 2’ C B. Let Bo 
be such that AN Bo = 9 and |Bo| = |B—A’|, say ho : Bo = B—A’. Let B’ = AUBo 
and interpret the symbols of £on B’ so that g = hU ho is an isomorphism of 8’ 
onto 8. Since h is an isomorphism of 2 onto 2’, we will automatically have 2 C 8’ 
(21 < %’ for (2)). 


Extensions and elementary extensions of 2 can be characterized by sets of 
sentences of L(A). 


DEFINITION 2.4. (1) A basic formula is a formula which is either atomic or the 
negation of an atomic formula. (2) The (basic) diagram of 2 is the set 


Ag = {| 6 is a basic sentence of £(A) s.t. Xa — G}. 


LEMMA 2.3. (1) XC B iff Ba — Ag. (2) A ~ B iff Bs = Ay, te. Ba E 
Th(a). 


The lemma is immediate from the definitions. Th(2l4) is sometimes called the 
elementary diagram of 2. 

Of course, Ay and Th(2l4) can have models in which the constant @ is not 
interpreted as the element a of A. Tese other models turn out to be precisely the 
models into which 2 can be embedded or elementarily embedded. In the first case 
this is quite clear. 


LEMMA 2.4. 8 (an £L-structure ) can be expanded to a model B* of Ag iff Bt 
is embeddable in B. 


Proor. If h : & = 2’ where 2’ C B then defining B* for L(A) by setting 
a>” = h(a) yields a model of Ay. Conversely, if 8* / Ay then defining h(a) = 
ae" yields an isomorphism of 2 onto the submodel 2’ C 8 whose universe is 
Al = {a®"}. 


The corresponding result for elementary embedding is less obvious, because 
the requirement that 2l’ < B® involves satisfiability in 2’ rather than just B. It is 
convenient to have a lemma which tells us when a submodel of % is an elementary 
submodel which refers only to satisfiability in BS. 


LEMMA 2.5. Let AC B. Then A ~< B iff for every O(a1,...,Un,y) of Land 
all a1,...,dn € A, if Ba = Syd(Gq,...,Gn,y) then there is some b € A such that 
Ba & O(G,...,G, 6). 
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Proor. We show by induction on formulas w(a1,...,@,) of Cthat for all 
Q1,--+,0n € AA, | W(H,...,G) iff Ba | V(G,...,G). All steps are clear 
except for the quantifiers. So assume ¢(a1,...,2n,y) satisfies the inductive hy- 


pothesis and consider W(a1,.--,an) = dyd. If Aq v(@, ...;@m) then %4 - 
(@,...,@n, 6) for some b € A, hence B4 — 6(G@&,...,Gn,b) andso By F W(G,... 
Conversely, if %4 E w(@q,...,@,) then the hypothesis of the lemma yields B 4 


$(@7,...,@p,b) for some b € A, so A, K O(G,...,Gn, 6) and Aa - W(GH,...,Gp). 


Gn). 


With the lemma it is now easy to prove the characterization of the models into 
which 2 is elementarily embeddable. 


PROPOSITION 2.6. 2 is elementarily embeddable in 8 iff B can be expanded to 
a model 8* of Th(2,). 


PRooF. We need just to prove the direction from right to left. If 8* - 
Th(24) then we define, as before, 2’ to be the submodel of 8 with universe 
A’ = {a®"| a € A}. Then & & A’, so it remains to show 2’ < %B, for which 
we use the lemma. Given $(21,...,2%n,y) of Land aj,...,a/, € A’, suppose Ba — 
Jyo(21,...,¢n,y). Well, a, = a2" for a; € A, so B* K Ayd(Gq,...,Gn,y). There- 


fore X4 F dyd(G@,...,Gn,y), so Xa — O(@,...,Gq,b) for some b € A. Therefore 


B* LK o(aq,...,0,5), ie. Bar K o(al,...,a4,0) where b' =” Al. 


y] n? 


In this proof, the models 8* and % 4, are different, since they are for different 
languages, but the correspondence @ ¢> (@" ) sets up a “translation” between them. 

Intuitively, an elementary extension of 2 is just like 21 but larger, that is, has 
more elements and (perhaps) more types of elements. The following is easy from 
the definition. 


LEMMA 2.7. Assume 2 ~< 8. Then % realizes every type realized in A. 


ProoF. If ao,...,@n € A realize [(ao,...,2%n) in 2 then they also realize 
T(ao,---,;%n) in B since A ~ B. 


More importantly, we can now realize types in elementary extensions of given 
models. 


THEOREM 2.8. Assume I'(ao,...,2%n) is consistent with Th(2l). Then there is 
some 8 such that A ~< B and B realizes T. 

Proor. Let £* = L(A) U {co,..., cn}, where co,...,Cn are constants not in 
L(A). Let 


Y= Th(2a) U {h(co,---,en)| 6 € TH. 
If B* —E ¥ then 2 is elementarily embeddable in 8’ = $* [ £ which realizes T. By 
Lemma 4.3.2, we obtain % such that 2 < 8% and B & 8’, so B also realizes I’. To 
show © has a model we use compactness. If “9 C & is finite, then 


Xo C Tha) U{P(Co,---+€n)| 6 € Tot 


for some finite 9 C I’. The hypothesis yields a§,..., a7, € A such that (24, a),...,a7) 
Me. 


Elementary embeddings give us a precise way to say that (w,<) is the smallest 
model of its complete theory. 


3. THE BACK-AND-FORTH METHOD 69 


PROPOSITION 2.9. Let T = Th((w,<)) and let 8 ET. Then (w,<) =~ B. 


PRooF. For each n € w there is a formula ¢,(x) which defines {n} in (w, <). 


Therefore T — IJr¢,(x) for each n € w, and since B® — T we must have Bp 
én(bn) for bn € B, for each n € w. Let B* = (B,bn)ncw be the expansion of B 
to £(w) in which 7 is interpreted by by. We claim that 8* — Th((w,<)). Let 
wW(xo,---,L~) of Lbe given and suppose (w, <),, FE w(%,.-., 7%). Then, since ¢,,’s 
define {n}, we actually have 


(w, <) = Vito °° -Vrr[Ono (Lo) A+++ A Ony (k) = y]. 


Therefore this sentence is also true in 8, whence it follows that B* — w(7,..., 7). 
Proposition 4.3.6 thus yields the conclusion. 


Also, of course, it follows from this that a type realized on (w, <) is realized in 
every B = (w, <). Further, if B = (w,<) but B F (w, <) then S$ must realize the 
type {7¢n(x)| n € w}, which is omitted in (w,<). Thus, (w,<) is the only model 
of T which realizes only the types which T locally realizes. 

We shall also consider chains under inclusion. 


DEFINITION 2.5. Let I be linearly ordered by <. A family {2l;};¢7 is a chain 
iff 2; C WM; whenever 7,7 € I,7 <j. {Ai}ier is an elementary chain iff 2; < 2, 
whenever 1,7 € I,i <j. 

DEFINITION 2.6. Let {2;};e7 be a chain. The union of the chain {2;};e7 is the 
model % defined by B = Ue, Ai, CPS CE (Og; e3 Gg) SPO Ga 
@1,...,Gm € Aj, and R®(a1,...,@m) holds iff R™(a1,...,@m) holds if a1,...,@m € 
Aj. We write 8 = User 2A; 

LEMMA 2.10. The union ® of the chain {2;};c7 is well defined, and A; C B 
for everyie€ I. 


THEOREM 2.11. Let {2;}icr be an elementary chain and 8 = j,-,%i- Then 
A; < B for everyie I. 

PROOF. We prove, by induction on ¢(x,...,%n), that for every i € I and 
for every do,..-,@n € A; we have (2;)4, F O(Go,...,Gn) iff Ba, E O(Go,...,G). 


The only case of difficulty occurs when $(#) = Jyv(Z,y). Let ag,...,dn € A; 
and suppose By,  $(G,...,Gn). Then Bg - W(a,...,Gn,b) for some b € B- 
but then 6 € A; for some j € I, i < j. So by inductive hypothesis we know 
(Aj), F O(Go,-..,@n) as desired. 


We will use this theorem to construct a model in an infinite number of steps, 
each “approximating” the result. 


3. The Back-and-Forth Method 


If two models are isomorphic, how might one prove this? In this section, we 
explain a general method which we will use, both here and in the next chapter, to 
show models are isomorphic. Although this method has extensions to uncountable 
models, we will restrict our attention to just countable models. 

To motivate the method, consider two countable models 2,8. Then we can 
list the elements of both their universes, say A = {a,,|n € w} and B = {b,| n € w}. 
To define an isomorphism, say h, of 21 onto S we need to specify for each n € w 
an element d,, € B such that h(a,) = d,. We must also specify for each n € w an 
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element c,, € A such that h(c,) = b,. [Of course, each d,is some bp and each cy 
is some a, but it is notationally easier to introduce new designations c,,d;,.] We 
want to do this in such a way that the resulting h is an isomorphism, that is for 
every atomic formula $(a@o,...,%n) and ag,...,a),E A 


Ws d(ah,..-,a/,) iff Be E d(h(ah),...,h(an)’). 


Our procedure is to do this recursively—that is, knowing do, ldots,dy,— 1 and co,...,Cn—1 
we will define d, and c,. The problem is what inductive hypothesis on do,...,dn—1 
and ¢g,---,€n—1 will enable us to choose appropriate d,, and c,,? At the very least 
we will need to know that the choices already made will make the piece of h al- 
ready defined behave like an isomorphism-i.e. for any atomic a(ao,...,2,%) and 
a,--+,4, € {do,---,Gn—1,C0,+--;Cn—1} we have A, — Cee if Be E 
a(h(ao),..-,h(a},)). This may not be enough, however. In general we need to have 
some notion of “similarity” between tuples from A and tuples from B which does 
guarantee that we can continue to build up our isomorphism. 


THEOREM 3.1. Let 2 and 8 be countable. Assume there is some relation ~ 
between n-tuples of elements of A and n-tuples of elements of B, for all n, such 
that: Vag,...,@,-, € A Vbg,...,5,_-1, EB 


1) (ap,.--,@,,_1) ~ (bp, --, 00 _,) = for all atomic a(xo,...,2p—-1) of L, AA - 
ae 0 n—-1 Oors to’ n—-1 
a(ap,---,%,_1) tf Be - a(bp,..., 0,4). 
2) (ap,---,@,-1) ~ (09,---, 0,1) = Val, € Addl, € B such that (ag,...,a),) ~ 


(bp,-.-,01,) and Vb), € Baal, € A such that (ap,...,a),) ~ (b,---,0,)- 
HO~O. 
Then A = B, and in fact whenever (ag,...,a),_1) ~ (bp,---, 01,1) there is 


some isomorphism h of 1 onto B with h(a}) = bv; for alli =0,...,n—1. 


ProoF. Let A= {a,| n € w}, B = {b,| n € w}. We define, by recursion on n, 
elements d, € B, cn € A such that for each n € w we have 


(ao, Co, @1, C1, Ras plasty) oe (do, bo, di, b1, oa pda Cri)' 


This is clear, using condition 3) to start and condition 2) to continue. By condition 


1) the resulting map A is an isomorphism. Given to begin with that (aj,...,a},_1) ~ 
(bp,.--,0/,_1) we could have started with that rather than @ ~ @ to obtain an 
isomorphism taking aj, to bi for each i =0,...,n—1. 


The trick in using this result is in coming up with a relation ~ of similarity 
which has properties 1)-3). 


EXAMPLE 3.1. LY = {<}, © = the axioms for dense linear order without 
endpoints. Let 2,8 — . Define (ao,...,@n) ~ (bo,---,0n) iff (a; <* a; & b; <® 
b; alli,j7 = 0,...,n). Then 1)-3) of the theorem are satisfied, therefore 2 = B 
provided |A| = w = |B|. 

(Even in this example, one can find 2,8 —/ © such that |A| = |B] 4 w but 
2A% B.) Note that this example shows that T = C'n(X), “the theory of dense linear 
order without endpoints,” is w-categorical, hence complete by the Los-Vaught test. 
It follows, for example, that (Q, <) = (R, <). 
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4. Exercises 


Let T be a complete theory of £, and let (a) and U(x) be complete L-types 
consistent with T. Assume that every element of every model of T' realizes 
either ® or V. Prove that both ® and W contain complete formulas. 

Let £ be a countable language containing (at least) the binary relation symbol 

E. Let T be a consistent theory of £ such that for every 2] T, E™ is an 

equivalence relation on A which has at least one infinite equivalence class. Prove 

that there is some y(x) € Fmz, consistent with T, such that for every A —- T 

every a € A which satisfies y belongs to an infinite E™-equivalence class. 

Prove or disprove: 

(a) (Q,<) has a proper elementary submodel. 

(b) (Z,<) has a proper elementary submodel. 

(a) Let T be a complete theory of £, and let ®(x) and U(y) be types each 
of which is realized on some model of T’. Prove that T’ has some models 
realizing both ® and W. 

(b) Give an example to show that the result in part (a) can fail if the theory 
T is not complete. 

Let 2% = (Q,+,-,0,1). Prove that T = Th(Q) is not w-categorical. 

Let ®(x) be a type consistent with the complete theory T but which is realized 

by at most one element in every model of T’. Prove that there is some formula 

(a) consistent with T such that 


T E Va(w(a) > y(x)) for all gy € ©. 


Let T’ be a complete theory of £, and let 21 and 8 be models of T. Prove that 

there is some € — T such that both 2( and S$ can be elementarily embedded in 

€. 

(a) Assume that 2 < 8 and 24 B. Prove that there is no formula v(x) of £ 
such that y® = A. 

(b) Give an example of 2,8 and v(x) where A C B, AA B, but y® = A. 

Let T = Th((Z, <)). Let (a, y) = (@ < y) A7d2z(a@ < zAz < y). Prove that 

w(a,y) is a complete formula with respect to T. 


CHAPTER 5 


Countable Models of Complete Theories 


0. Introduction 


In this chapter we give detailed attention to the question of what the collection 
of countable models of a complete theory T can look like. This includes consid- 
eration of how many countable models (up to isomorphism) T can have, and also 
how the models are related to one another, particularly by elementary embedding. 
We are also interested in how the properties of the collection of countable models 
of T are related to syntactical properties of T. Some particular questions we will 
consider (and answer) are the following: 

(1) What cardinals « can be the number of non-isomorphic countable models 
of some T’? We have seen examples where & = 1,w,2”, and we know k& < 2”, so this 
comes down to: (a) there are complete theories T such that T has exactly n non- 
isomorphic countable models, for 2 <n € w? (b) ifw<k < 2” is there a T with 
exactly « non-isomorphic countable models? The question in (b) is complicated by 
the fact that even the existence of such a cardinal « is independent of the axioms 
of set theory. Although we will discuss this later we will not be able to solve this 
question. 

2) Characterize (syntactically) the theories T with exactly one countable model, 
i.e., which are w-categorical. 

3) Does 2 ~~ B and B ~~ A imply that 2A B, for countable A,B? 

4) If 2, are both countable models of T must we have either 2 =< 8 or 
B =~ A? 

Positive answers to both (3) and (4) would say that ~~ is a linear order on 
isomorphism types of countable models of T. 

(5) Does every T’ have a smallest model, i.e. a model 2 such that 2 =< B 
for every models 8 of T? Must such an 2 be unique? (Recall that previously we 
showed that (w,<) was such a model of T = Th((w, <))). 

(6) Does every T have a largest countable model, 8%, i.e., a countable model 8 
such that 2 =~< 8 for every countable model 2 of T? Must such a 8 be unique 
(up to isomorphism)? 

In each of (5), (6) if the answer is no, one would want to characterize the 
theories with such models. 

(7) If 21 = B are countable and realize the same types must 2 = 8? 

Similarly, we know that 2 =< % implies that % realizes every type that 2 
does. 

(8) Does the converse hold, for countable 2, 8? 


1. Prime Models 
Throughout, T is a complete theory in a countable language. 
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DEFINITION 1.1. A model 2 of T is prime iff for every B —E T A =~x B. 


In this section we will characterize prime models, characterize the complete 
theories which have prime models, and show the uniqueness of the prime model of 
a theory. 

Suppose that 2 is a prime model of T. Then, obviously, 2( is countable. Further, 
a type realized in 2 is realized in every model of T’ and thus must be locally realized 
in T (by the omitting types theorem). In fact we will see that these two properties 
characterize prime models. 

Let I'(xo,...,%n) be a complete type of T. If T’ is locally realized in T then 


there is a formula a(2o,..., 2%) consistent with T such that 
T E Va0--:Vin(a > ¢) 
for every $(20,.--,%n) € T. Since T is complete we must have 


a(ao,.--,@n) ET. 
A complete type generated (with respect to T) by a single formula is called princi- 
pal, and the generating formula is a complete as in the following definition. 


DEFINITION 1.2. A formula a(%o,...,%n) is a complete formula (or atom) of 
T iff T & dvo---dvna, and for every $(xo0,...,%n) either T E (a > ¢) or TE 
(a > 9). 


DEFINITION 1.3. A model 2 of T is atomic iff for every ag,...,@, € A there is 
some complete formula a(x9,...,%p) of T such that 
Ma - a(G,...,Gn). 


We have essentially shown the easy half of the following characterization theo- 
rem. 


THEOREM 1.1. Let AE T. Then is prime iff 2 is countable and atomic. 


PrRooF. (=) Assume 2 is prime and let ao,...,@, € A. Then tpy(ao,..., Qn) 
is a complete type realized in 2, hence by the above argument it contains a complete 
formula. Therefore 21 is atomic. 

(<=) Let 2 be countable and atomic. Let 8 be any model of T. We must show 
that 21 =~< %. We will do this by a variation on the back-and-forth method. Let 
A= {an| n € w}. We will define, by recursion on n, elements 6, € B such that for 
each n we have 

(Ql, ao,... , On) = (B, bo,..- £ Dis) 
That is: (8, bo,---,0n) F TR(yao,....a,}) in the language LU {G,...,Gn} where 
G@ is interpreted as b; € B. It will then follow that B* = (B,by)ne, is a model of 
Th(2,4), hence 2 ~~ B as desired. 

So, we first pick bo. Since 2 is atomic, a9 must satisfy some complete formula 
ao(xo). Then 8g F ao(bo) for some bo € B. Since ap is complete, we have 
T - Vx0(a0(ax0) + ¢(x0)), for every ¢ € tpy(ao). Therefore Bg / ¢(bo) whenever 
Bs, E d(G), so (A,ao) = (B,bo) as desired. Now, given bo,...,d, such that 


(21, ao,-.-, An) = (B, bo,...,bn) we show how to choose b,41. Let @n(Xo0,..-,2n) be 
a complete formula satisfied by ao,...,@n and let @n+41(Xo0,---,;£n41) be a complete 
formula satisfieed by ao,...,@n41. Then we must have 


T = Vx +08 Van(Qn = Fan41Qn41) 
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since 4a@n410n+1 € tpa(ao,...,@n). By our inductive hypothesis we must have 


Be KE an(bo,...,bn), so there is some bn41 € B with BB KE an41(bo,..-,bn41). 
Then as before we see that 


(QL, Q0,--+5 Qn+1) = (B, bo, seey bn+1) 
which completes the proof. 


For each n = 1,2,... define 


Tn (@o,---,%n—-1) = {7a(Xo,.--,%n—1)| @ is an atom}. 


Then 2 is atomic iff for each n 2% omits [,,. Thus, by the preceding theorem, T 
has a prime model iff it has a countable model omitting each T,,. By the extended 
omitting types theorem, this happens iff T locally omits each [,. 


DEFINITION 1.4. T is atomic (or atomistic) iff for every formula 4(xo,...,2n) 
consistent with T there is some complete formula a(xo,...,%n) such that (a A @) 
is consistent with T. 


We leave the reader to check that T is atomistic iff T locally omits each Tp. 
The preceding discussion then shows: 


THEOREM 1.2. T has a prime model iff T 1s atomic. 


It is not immediately clear how restrictive the condition that T is atomic re- 
ally is. There are theories that are not atomic, but most examples are specially 
contrived-Th((Z,+)) is a natural example of a theory without a prime model, but 
the proof uses a lot of information about this theory. 

The next result gives a large class of atomic theories. 


THEOREM 1.3. Assume there are just countably many different complete types 
consistent with T. Then T has a prime model. 


PRoor. For each n € w, let 2 (xo,..., Ln) list all non-principal complete types 
in %,...,%p consistent with T. Then each ®% is locally omitted in T’. So, by the 
extended omitting types theorem, T' has a countable model omitting every ®7; this 
model must then be atomic, hence prime. 


THEOREM 1.4. If 2,8 are both prime models of T then X= B. 


2. Universal and Saturated Models 


DEFINITION 2.1. A model 2 of Tis countably universal iff 2( is countable and 
whenever % is a countable model of T then B =~ 2. 


The following lemma is clear: 


LEMMA 2.1. (1) If & is a countably universal model of T then realizes every 
type consistent with T. 

(2) If T has a countably universal model then there are just countably many 
complete types consistent with T. 


The obvious conjecture is that the converses of both parts of the lemma hold. 
First note that we do have the following. 


PROPOSITION 2.2. If there are just countably many complete types consistent 
with T then T has a countable model realizing every type consistent with T. 
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Thus all that remains is the converse to part (1) of the lemma. In analogy with 
the proof that countable atomic models are prime, we would proceed as follows: let 
Ql (be countable and) realize every type consistent with T. Let 8 be an arbitrary 
countable model of T, say B = {b,| n € w}. We try to define, by recursion on n, 
elements a, € A such that for every n we have 


(B, bo, ..-,0n) = (A, ao,.-., Gn). 


For n = 0 let To(xo) = tpas(bg). By hypothesis To is realized in 21, say by ag, so 
we have (%, bo) = (2, ao). Now for n = 1 let Ty (x0, 21) = tps (bo, b1). Once again 
there are ag,a, € A realizing ['1(%o, 21), hence such that (B, bp, 64) = (2, ag, a4). 
But we have already chosen ag so we need an a, such that (ao,a1) realize T). 
There is no obvious reason why such an a; should exist. We are thus forced into 
the formulating of the following stronger property: 


DEFINITION 2.2. 2 is w-saturated iff for every n € w and all ao,...,@n—1 € A, 
the structure (2, a9,...,@n—1) [for the language L(ag,...,dn) = LU{%,...,Gr—1] 
realizes every L(ao,...,@n,)-type ['(y) consistent with it. 


If 2( is countable and w-saturated, we may also say that 2l is countably saturated. 
The above argument can be carried out for w-saturated models, so we obtain the 
following theorem. 


THEOREM 2.3. Assume 2 is an w-saturated model of T. Then every countable 
model of T can be elementarily embedded in 2. Therefore 2 is countably universal 
provided A is countable. 


We thus have (i)=(ii)=>(iii), where: 
(i) 2 is countably saturated 
(ii) 2 is countably universal 
(iii) lis countable and realizes all types consistent with T. 
Do either, or both, of these arrows reverse? We cannot answer these questions 
at the moment, but we can show that the existence of a countable w-saturated 
model is equivalent to the existence of the other two types of models. 


THEOREM 2.4. The following are equivalent: 
(i) T has a countable w-saturated model 
(it) T has a countable universal model 
(iti) There are just countably many complete types consistent with T. 


PRooF. It suffices to show (iii)=(i). Assume that (iii) holds. Then in fact, we 
have the following: 

({) for every 2 - T, every n € w and every dop,...,@,_1 € A there are just 
countably many different complete 1-types T'(y) of £U {G,...,@,—1} consistent 
with Th((Ql, ao,..-,@n—1)). 

The reason for this is that every such I'(y) is ®(@,...,@n—1, y) for some com- 
plete L-type ®(ro,...,%n-1,y) consistent with Th(2l) = T, and (iii) asserts there 
are just countably many such ®. 

(t) for every countable 2 / T there is some countable 2’ such that 2 ~< 2’ 
and for every n € w and every ao,...,dn € A, YM’ realizes every type I'(y) of 
LU {G,..-,G,} consistent with Th(2, ao,..., Gn). 

This holds by applying Theorem 4.2.8 to the theory T* = Th(24) where the 
T;,’s list the complete 1-types in just finitely many new constants-—the list being 
countable since A is countable and by (f). 
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We now can show T has a countable w-saturated model. Let 29 be any count- 
able model of T an, for every n € w, let A%,41 = (Ay)! be the model given by 
({) starting with A,. Thus {2%,}new is an elementary chain of countable models. 
Let % = Unew %n- Then 2 is a countable model of T, which can be shown to be 
w-saturated, since every finite subset of A is a finite subset of some A, hence the 
types over that finite subset are all realized in 2,,41, so in particular in 2. 


If T has uncountably many complete types, then we no longer get countable 
w-saturated models but similar arguments show: 


PROPOSITION 2.5. T has w-saturated models of power « for every K > 2”. 


DEFINITION 2.3. 2 is w-homogeneous iff for every n € w and every do,...,@n 
and bg,..-,0, € A 


if (Ql, ao,-.-, Qn) = (2, bo,..., bn) then (2, ao,...,an) & (A, bo,.--, On). 


THEOREM 2.6. A countable model is w-saturated iff it is both w-homogeneous 
and universal. 


3. Theories with Just Finitely Many Countable Models 


DEFINITION 3.1. A model 2 of T is countably universal iff 2( is countable and 
whenever % is a countable model of T then B =~ 2. 


The following lemma is clear: 


LEMMA 3.1. (1) If & is a countably universal model of T then realizes every 
type consistent with T. 

(2) If T has a countably universal model then there are just countably many 
complete types consistent with T. 


The obvious conjecture is that the converses of both parts of the lemma hold. 
First note that we do have the following. 


PROPOSITION 3.2. If there are just countably many complete types consistent 
with T then T has a countable model realizing every type consistent with T. 


Thus all that remains is the converse to part (1) of the lemma. In analogy with 
the proof that countable atomic models are prime, we would proceed as follows: let 
2 (be countable and) realize every type consistent with T. Let 8 be an arbitrary 
countable model of T, say B = {b,| n € w}. We try to define, by recursion on n, 
elements a, € A such that for every n we have 


(B, bo,..-,0n) = (A, ao,.-., Gn). 


For n = 0 let To(xo0) = tps(bo). By hypothesis [9 is realized in 21, say by ao, so 
we have (%, bo) = (2,ao). Now for n = 1 let Ty (ag, 71) = tpss (bo, 61). Once again 
there are ag,a, € A realizing ['1(%9,21), hence such that (B, bj, 6) = (2, ag, a4). 
But we have already chosen ag so we need an a, such that (ao,a1) realize Ty. 
There is no obvious reason why such an a, should exist. We are thus forced into 
the formulating of the following stronger property: 


DEFINITION 3.2. 2 is w-saturated iff for every n € w and all ao,...,@n_1 € A, 
the structure (2, a9,...,@n—1) [for the language L(ag,...,dn) = LU{%,...,Tr—1] 
realizes every £L(ao,...,@n,)-type ['(y) consistent with it. 
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If 2 is countable and w-saturated, we may also say that 2lis countably saturated. 
The above argument can be carried out for w-saturated models, so we obtain the 
following theorem. 


THEOREM 3.3. Assume 2 is an w-saturated model of T. Then every countable 
model of T can be elementarily embedded in 2. Therefore 2 is countably universal 
provided A is countable. 


We thus have (i)=(ii)=>(iii), where: 
(i) 2 is countably saturated 
(ii) 2 is countably universal 
(iii) lis countable and realizes all types consistent with T. 
Do either, or both, of these arrows reverse? We cannot answer these questions 
at the moment, but we can show that the existence of a countable w-saturated 
model is equivalent to the existence of the other two types of models. 


THEOREM 3.4. The following are equivalent: 
(i) T has a countable w-saturated model 
(it) T has a countable universal model 
(iit) There are just countably many complete types consistent with T. 


PRooF. It suffices to show (iii)=(i). Assume that (iii) holds. Then in fact, we 
have the following: 

({) for every 2 - T, every n € w and every ao,...,@,—-1 € A there are just 
countably many different complete 1-types T'(y) of £U {G,...,@,—1} consistent 
with Th((Ql, ao,..-,@n—1)). 

The reason for this is that every such I'(y) is ®(@,...,@,—1, y) for some com- 
plete L-type ®(xo,...,%n-1,y) consistent with Th(2) = T, and (iii) asserts there 
are just countably many such ®. 

({) for every countable 2 | T there is some countable 2’ such that 2 ~< 2’ 
and for every n € w and every ao,...,dn € A, YM’ realizes every type I'(y) of 
LU {G&,..-,G,} consistent with Th(2, ag,..., Gn). 

This holds by applying Theorem 4.2.8 to the theory T* = Th(24) where the 
T;’s list the complete 1-types in just finitely many new constants-—the list being 
countable since A is countable and by (f). 

We now can show T has a countable w-saturated model. Let 29 be any count- 
able model of T an, for every n € w, let An41 = (Ay)’ be the model given by 
({) starting with 2,,. Thus {2,}ne. is an elementary chain of countable models. 
Let 2 = Cees A,. Then 2 is a countable model of 7, which can be shown to be 
w-saturated, since every finite subset of A is a finite subset of some A, hence the 
types over that finite subset are all realized in 2,,41, so in particular in 2. 


If T has uncountably many complete types, then we no longer get countable 
w-saturated models but similar arguments show: 


PROPOSITION 3.5. T has w-saturated models of power « for every K > 2%. 


DEFINITION 3.3. 2l is w-homogeneous iff for every n € w and every do,...,@n 
and bo,...,6n € A 


if (Ql, ao,.--, Qn) = (2, bo,..., bn) then (2, ao,...,an) & (A, bo,..-, On). 


THEOREM 3.6. A countable model is w-saturated iff it is both w-homogeneous 
and universal. 


CHAPTER 6 


Further Topics in Model Theory 


0. Introduction 


In this chapter we cover a number of additional topics needed for further work 
in model theory. Proofs are only given in outline, with the reader left to fill in the 
details. We begin with another application of the Henkin method, to prove Craig’s 
interpolation Theorem, a fundamental property of first-order theories. From it we 
derive the definability result of Beth. In 6.2 we define the «-saturated models, 
for arbitrary «, generalizing the results in 5.2. Skolem functions are introduced in 
6.3 and used to derive stronger forms of the Lo6wenheim-Skolem theorem. We also 
introduce the notion of indiscernible sequence, although our proof of the existence 
of such sequences is even sketchier than usual and will depend on a combinatorial 
theorem which we will not prove. Various applications are briefly discussed in 6.4, 
including preservation theorems, model-completeness and 2-cardinal theorems. We 
also exhibit a finite axiomatization of Th((w, <)) and study the countable models 
of this theory which provide counterexamples to the conjectures of the previous 
chapter. 


1. Interpolation and Definability 


Let £; and £2 be two languages, let £ = £19 £2, and let 01,02 be sentences of 
Ly, £2 respectively. Suppose that - (a1 + a2) — that is, whenever 2 is a structure 
for (£1 UL2) and & F o; then AE og. Now AE oy iff APL E o1, so if we take 
any A’ with W > L, = AT Ly we will still have 2’ — o,, and so A’ & az (despite 
the fact that 2’ [Ly A Wf Le). Similarly, if A” is such that 2” | Lo = A’ f Lo 
then we still have 2” — o2. That is, we have shown the following: if 21 E o; and 
WA’ PL=ATL then A” — og. It is reasonable to expect, then, that the validity 
of the implication depends just on the language £. This is in fact true, and is the 
content of Craig’s well-known theorem. 


THEOREM 1.1. (Interpolation) Let L = Ly N Le, 01 € Sng,,02 € Sng,. 
Suppose that — (0, > 02). Then there is some sentence 6 of £L such that 


E (0, > 0) and FA 02. 


ProoF. First note that we may assume that £, and Ly are both countable 
(since only finitely many symbols can occur in a; and 02). 

We may assume that there is no sentence 0 of £ such that — (a1 > 0) and 
E (@ — o2), or, equivalently, there is no sentence @ of £ such that o, - @ and 
02 — 70. We will show that {01,02} has a model and so /- (01 — 072). 

Let Co be a countably infinite set of individual constant symbols not in £,ULo. 
Let £*, £}, £3 be the results of adding the constants in Cp to £, £1, £2 respectively. 
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We will prove that there is a Henkin set Uf of £j-sentences and a Henkin set 
&5 of £3-sentences such that a, € Ut, -02 € U3 and (f) for every sentence 0 of L*, 
6 € Dt iff 6 e DS. 

Once we have done this, let 2; be an £j-structure which is a canonical model 
for U7, and let 2X2 be an £3-structure which is a canonical model for 43. Then 
At, [L* S Ay [L* by (Tt), say under the isomorphism h. We can then expand 2, to 
an (£7 U £3)-structure 2 so that A is an isomorphism of 2, [£5 onto ty. Therefore 
Ais a model of Nj} U US, in particular of {01,402} as desired. 

Thus we are done once we have defined Henkin sets U7, U5 as above. This we 
do by carrying out the Henkin construction simultaneously starting with ©, = {o1} 
and X2 = {702}, with an added joint condition to guarantee that we end up with 
(t) holding. 

Let Snes = {0}: n € w} and Sn¢x = {62 :n € w}. We define, by recursion on 
n, sets Tf, C Snes and T?, C Sng; starting with [4 = X; (i = 1,2) and such that 
the following hold for all n € w and i = 1, 2: 

0%) T4 CT%, and Tt, \T%, is finite, 

10, CT 1-08 30, e Dis 

27) if OF €Ty,41 and 6), = Vry(x) for some v(x), then -y(c) € T',,, for some 
constant c, 

and the following joint condition 

3n) there is no sentence 6 of £* such that [*,, - @ and T?,, E70. 

Note that (3,,) implies that both [,,, and T?,, are consistent. Further (3,) 
for all n guarantees that ({) holds for 4* = U,,<,,F%,, and thus we are done once 
we show the construction can be carried out. 

First of all, note that our beginning assumption is that there is no sentence 
6 of £ such that T) & @ and [2 — 76. Since I) do not contain constants from 
Co we see that there is no sentence @ of L* such that [4 & 6* and T2 — 76* (by 
generalization on constants). [We refer to this as (3_1)] 

So, given I, finite sets of sentences of L* satisfying (3,1), we wish to define 
I’ ,, satisfying (0',), (11), (2,) and (3,,). We follow the proof of the corresponding 
steps in the Completeness Theorem (pp 50-51) to define in turn Dies eee ety re 
where the requirements that each of these sets is consistent is replaced by the 
stronger requirements leading to (3,,) — that is, that there is no sentence 0 of L* 
such that 


(a) ener _E 6 and [2 - -6, 
(by D235 = Gand. S58, 
(c) Digi 9 and T% 4). 79, and finally (3,) itself. 


Obviously, the hypothesis that (3,,_,) holds is used to start. All the sets are 
then similar to those of the Completeness Theorem, and are left to the reader. 


2 
Phat 


Interpolation also holds for formulas with free variables in place of sentences 
01,02. To see this, replace the preceding Theorem to obtain an interpolating 6’ with 
these new constants, and replace the constants with the variable to get a formula 
@ of £L. Thus: 


THEOREM 1.2. Let £L = £19 Lo, vi(%) € Fmg, (@ = 1,2). Suppose that 
— (yi > 2). Then there is a 0(@) of L such that — (y, > 0) and E (0 > 2). 
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As an application of Craig’s Interpolation Theorem (in the form applying to 
formulas) we derive another classic result, which concerns definability of a relation 
in all models of a theory. 

Let £ be a language, R a predicate symbol of nm arguments no in £. Let 
Ly, = LU{R}, and let T be a theory of £1. We say that Ris (L - ) definable in T 
iff there is a formula y(x,...,@%p—1) of £ such that 

TE Va9-+-Vin—1[Rx9 +++ tn_1 © Y(Z)]. 
(Reference to £ is sometimes suppressed). 

Clearly if R is definable in T then the interpretation of R in any model of T is 
uniquely determined by the interpretations of the other symbols (since it must be 
whatever relation y defines) — more formally, the following must hold: 

(x) if A,B KT and ASL = BL, then R®™ = R® (and so A = 8B). 

The definability theorem of Beth states that the converse also holds. 


THEOREM 1.3. Let £L; = LU{R} where R is an n-ary predicate symbol not in 
L. Then R is L-definable in T iff (x) holds for T, where T is any theory of L. 


PRooF. It suffices to show that (*) implies the definability of R, so let us 
suppose that (*) holds. We wish to formulate (*) in a syntactical manner. Let S$ 
be another n-ary predicate symbol not in £1. Let R’ be the theory of Ly = LU{S} 
obtained from T by replacing R everywhere by S. Then T and T” are “the same”, 
except one uses the symbol R where the other uses S. (*) then precisely says that 

TUT’ EVa[RE © SZ). 
In particular, then 
TUT’ — (Rz > SZ). 
By compactness there are sentences o € T,o’ € T’ such that 
{o,0'} — (RE > $5), 


Le., 
E (0 A RZ) > (o’ > SZ). 

Interpolation applied to y; = (0 A R&) and y2 = (a’ > SZ) yields a formula 
6(%) such that 


E (o A RZ) > 0(®) and - 6(%) > (a’ > SZ). 


Therefore T — (RX —> 0) and T’ — (6 > SZ). Replacing S everywhere by R 
in the last consequence yields T — (6 > RZ). 
Thus we have T — (RX + 0(Z)), and so R is £-definable in T, as desired. 


Beth’s Definability Theorem also applies to functions — simply consider an 
n-ary function as an (n + 1)-ary predicate. The defining formula then defines the 
graph of the function. 

The following easy fact is sometimes useful. 


LEMMA 1.4. Assume R is £L-definable in T. Then every L-structure A which 
is a model of TO Sng can be expanded to an LU{R}-structure A, which is a model 
of T. 


Proor. Simply define R®* = y*, where y is the £-formula defining R in T. 
This works since for any sentence 0 of £; we have T - 6 iff T — 0’, where 6” is an 
£-sentence resulting from @ by replacing R throughout by ¢. 
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Specific, direct applications of Beth’s theorem are hard to come by. The most 
striking applications are of the contrapositive — i.e., from the non-definability of 
R in T to conclude (x) fails. The non-definability might be shown directly or use 
the contrapositive of the above lemm. 


EXAMPLE 1.1. £ = {<}, Li = £U{+}, T = Th((w,<,+)). Then &A = 
(w+ Z,<) ETNSn¢ but 2 cannot be expanded to a model of T, hence there is 
some 8 = (B,<*) = (w,<) such that there are 2 different functions +, and +2 
on B such that (B,<®,+1) = (B,<®, +2) = (w, <,+4). 


EXAMPLE 1.2. Similarly, for some (B, <*,+*) there are 2 different -1,-2 with 
{B, a +%, 1) = (B, a, +%, “2) = (w, S, +, -). 


2. Saturated Models 


This section concerns generalizations of the material in 5.2 to uncountable 
cardinals. For simplicity, we consider only theories in countable languages in this 
section. This restriction is not essential, but without it we would have to make 
reference to the cardinality of the language in our results, and proofs. 

The fundamental definitions are as follows: 


DEFINITION 2.1. (a) Let « >w. A model 2 is «-saturated iff for every X C A, 
if |X| < « then 2x realizes all £(X) types (in one variable) consistent with 
Th(2x). 

(b) Let T be a complete theory and & > w. Mis a K-universal model of T iff AE T 
and whenever 8 | T, |B| < « then 8 =~ A. 


In this terminology, the theorem on pg. 92 states that on w-saturated model 
of T is w;-universal. The generalization we obtain replaces w by any & > w and wy 
by Kt. 


EXAMPLE 2.1. £ has just a unary predicate P, T is the complete theory stating 
that both P and —P are infinite. Then all models of T are w-saturated, but of the e 
models of T' of cardinality w1, only the one in which both P and —P are uncountable 
is w -saturated. 


EXAMPLE 2.2. (R,<) is w-saturated but not w,-saturated. 
We first require a better notation for dealing with models of the form Ay. 


NOTATION 2. Let X = {ae :€ <a} CA. Then (2, ae)ecq is the expansion of 
A to the language LU {ce : € < a}, where ce’s are distinct constants symbols not 
in £. By convention, if in the same context we also have (B, be)ecq this is also a 
structure for the same language, and thus the statements (2, ae)eca = (B, be)eca 
and (2, de)eca = (B, brilecq are unambiguously defined. 


We can now follow the argument in 5.2, although using transfinite recursion, 
to establish the following: 


THEOREM 2.1. Let 2 | T be «-saturated (kK > w). Then & is a K+ -universal 
model of T. 


Proor. Let 8 ET, |B| <«. Then B= {be :€ < K}. We define, by recursion 
on € < &, elements ag € A such that for every a < & we have: 


(B, be)e<o = (21, dg )é<a- 
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Then the map sending be to a¢ (all € < «) will be an elementary embedding of 
SB into 2. 

The base of our induction a = 0, is the statement 8% = 2, which holds since T 
is complete. 

We break the inductive step into two cases — successor and limit ordinals. 

First case: a = 8+1. The inductive hypothesis is that (B, be)ecg = (2, ae)ecp. 
We need to define ag so that (B, be)e<g = (2, a¢)e<g, which we do by picking ag 
to realize in (2, a¢)e<g the complete (£U {ce : € < B})-type of bg in (B, be)ecg. 
This can be done since 2 is «-saturated and |{ag : € < B}| < |B) <B<k. 

Second case: If a is a limit ordinal then our inductive hypothesis is that for 
every 8 < a (B, be)ecg = (A, ae)ecg. It follows that (B, be)eca = (A, de)eca, since 
a sentence of £U {ce : € < 8} can only contain finitely many constants and hence 
really is a sentence of LU {ce : € < 8} for some 8 < a. This completes the limit 
ordinal step. 


This proof is really half of a back-and-forth argument (of length « rather than 
w). Adding the other half of the argument we obtain this uniqueness result, details 
of which are left to the reader. 


THEOREM 2.2. If 2,8 are both k-saturated, |A| = |B| = k and 1 = 8%, then 
SB. 


TERMINOLOGY 1. If 2l is K-saturated when « = |A|, we say that 2 is saturated. 


In 5.2 we characterized the theories having countable saturated models. The 
existence of uncountable saturated models is complicated by set theoretical consid- 
erations. The following lemma is clear from the definition. 


LEMMA 2.3. If T has a «*-saturated model of cardinality K+ then for every 
model A of T and every X C A with |X| <K, there are at most K* complete types 
of L(X) consistent with Th(Alx). 


The problem is that there may be 2" complete types consistent with a theory 
in a language of cardinality «. In this fashion we see, for example: 


COROLLARY 2.4. If Th((Q, <)) has a saturated model of cardinality of w1, then 
22 = Wy. 


We are in even worse shape with singular cardinals since one can easily show, for 
example, that Th((Q, <)) has no saturated model of cardinality , for any singular 
A. Hint to the reader: find a set X C A where |X| = cf(A) < A and a type of £L(X) 
not realized in 2x, for any A= (Q, <),|A] =A. 

The set-theoretical problem pointed out above is the only difficulty with «*- 
saturated models. 


THEOREM 2.5. Every model of cardinality < 2" has a «*-saturated elementary 
extension of cardinality < 2". 


PrRooF. Here too we follow the proof of the corresponding existence result in 
5.2, with additional cardinality arguments. We first show: 

(1) Let |B] < 2%, X C B, |X| < «. Then BS ~< 8%’ for some %’ such that 
|B’| < 2" and By realizes every £(X)-type consistent with Th(Bx) = Th(B). 

First note that |£(X)| < « hence there are at most 2" £(X)-types consistent 
with Th(Sx). By Compactness and the Lowenheim-Skolem Theorem we can find 
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a model %’ such that 8B ~ 8’, BY realizes each of the £(X)-types consisten with 
Th($x), and |B’| < 2" (cf. the Theorem on pg 73), thus establishing (1). Next we 
derive: 

(2) Let |B] < 2". Then there is some 8* such that B ~< %*, |B*| < 2% 
and for every X C B, |X| < k, BX realizes every £(X)-type consistent with 
Th(Sx) = Th(Bx). 

We first establish a necessary fact about cardinality, namely: 

XX CBX] <a} <2" 


This is true since {X : X C B,|X| < «} x “B and |*B| < (2")" = 2". Thus we 
can write {X :X C B,|X|<«}={Xe:€ < 2"}. Finally, we define an elementary 
chain by: Bo = 8, Bey1 = (Be)’, By = Uce, € for limit 1 < 2", where (Be¢)’ is 
the result applying (1) to Be, for the set X¢. One easily establishes by induction 
that |Be| < 2” for all € < 2” (the limit ordinal cases are the only ones requiring 
attention), and so 8* = B«is as desired. 

Finally, let 2 be given with |A| < 2". We define an elementary chain A¢,€ < K*, 
by Mo = MW, Res+1 = (€)*, Ay = Uce, Ae for limit ordinals 7 < «*. Once again, 
it is easy to see that A* = A,,+ has cardinality < 2". If X C A#, X < « then 
X C A¢ for some € < K* because «+ is regular, hence every £L(X)-type consistent 
with Th(2%) is realized in (Qe41)x. Thus A* is «+-saturated. 


The reader should try to find an w,-saturated elementary extension of (R, <) 
of cardinality 2”. 

One more reason for the seductive appeal of the GCH is the following obvious 
consequence of the preceding: 


COROLLARY 2.6. Assume GCH. Then every theory has a saturated model of 
cardinality «* for every K > w. 


Knowing that a theory has «t-universal models is not terribly useful. But 
knowing that it has saturated models is very useful. One can give, for example, a 
simple proof of the Interpolation Theorem of the preceding section assuming the 
existence of saturated models. As the reader is left to check, it suffices to establish 
the following lemma (due to A. Robinson who used it to derive Beth’s Theorem): 


LEMMA 2.7. Let £L=L,0Lo, let T,,T>2 be theories of Ly, £2 r‘espectively, and 
suppose T = (Ti [ZL UT2/L) is consistent. Then T, UT> has a model. 


PROOF. One first shows (using compactness) that there is some complete L- 
theory T* D T such that (T, UT*) and (Tj UT™*) both have models. Assume 
(T UT*) and (Tz UT*) both have saturated models of cardinality k. Let these 
saturated models be 2, and 22 respectively. Then 2, | £ and 22 | £L are also 
saturated and both models of T*, hence 2; [£L = 25 | L by the uniqueness theorem. 
One then defines 2 | (7, UT) UT™*) such that A[L, = %, and Af Ly & Ay as in 
the proof in the preceding section. 


Thus, we have another proof of Interpolation, under the set theoretic assump- 
tion that ht = 2" for some & > w. There are ways of using essentially this same 
proof and avoiding the additional assumption, but we will not pursue them here. 

The spirit in which saturated models were introduced originally involved unique- 
ness less than their acting as a sort of “universal domain” for a theory JT. That 
is, fixing a sufficiently large saturated model of T, one could study the models of 
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T just by looking at elementary submodels of this fixed model. Some arguments 
require the existence of many automorphisms. 


DEFINITION 2.2. 2 is strongly K-homogeneous (kK > w) iff: for every a < « and 
for all ag, az € A for € < a, if (A, de)eca = (A, ae )eca then (A, ag )eca = (A, ag eca- 


DEFINITION 2.3. 2 is strongly K-saturated iff 2 is K-saturated and strongly 
&k-homogeneous. 


As a consequence of the proof of uniqueness we see: 
COROLLARY 2.8. If 2 is saturated and |A| = & then is strongly K-saturated. 


The «t-saturated models of cardinality 2“ we have constructed are not auto- 
matically strongly «*-saturated. We can make them strongly «*-saturated by use 
of the following result in our construction. 


LEMMA 2.9. Assume that (2, a¢)e<a = (A, ag )e<a- Then there is some B such 
that A < B, |B| = |A| and (B, ag)eca = (B, ag)eca- 


We leave the proof of this lemma to the reader with the following hint: define 
an elementary chain 2l,,, 2 € w, by 2p = 2M and 2,1 is an elementary extension of 
A, such that (%n41,@)aedn = (An41,h(@))aea, for some map h with h(ag) = ag 
for all E <a. 

The existence result easily follows: 


THEOREM 2.10. Every model of cardinality < 2" has a strongly «*-saturated 
elementary extension of cardinality < 2”. 


3. Skolem Functions and Indescernables 


For any model 2 and any X C A there is a well-defined submodel of 2 generated 
by X (provided either X 4 9 or the language contains individual constants), that 
is a submodel 8 C 2 such that X C B and B C B’ for every B’ C A with X C B’. 
This is not in general true for elementary submodels, however. It would be true if 
21 had the property that B C 2 implies B ~ 2, since then the submodel generated 
by X would in fact be the elementary submodel generated by X. In this section 
we show that any theory T of a language £ can be expanded in a natural way to a 
theory T* in a larger language £* formed by adding “Skolem functions” such that 
all models of T* have the above stated property. 


DEFINITION 3.1. Let 2 be a model for some language £, and let X C A. 

(a) H(X), the hull of X in A, is the subset of A obtained by closing XU{c™ : ¢ in £} 
under F* for every F of L. 

(b) §(X) is the submodel of 2 whose universe = H(X), provided H(X) #4 0. 


Note that the model §(X) exists by the Lemma on page 74, when H(X) #4 0. 
The following Lemma is easily established. 
LEMMA 3.1. Given 2 and X C A: 

(1) |H(X)| < max(|X],|L)). 

(2) 9(X) is the smallest B C A with X C B, provided H(X) #0. 


Our goal is to establish the following: 
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THEOREM 3.2. Let T be a theory in a language L. Then there is a theory T* 
in a language L* such that 
(LOL RG T™, 
(wi) |C*|= || 
(iit) every model of T can be expanded to a model of T*. 
(iv) if W* ET* and B* C A* then B* ~ A*. 


The language £* and theory T* of this theorem will be defined as unions of 
chains {Lr}new, {In}new which “approximate” the property in (iv) more and more 
closely. We go from £,, and T,, to £y+41 and T,,41 by the following result. 


PROPOSITION 3.3. Let T be a theory of £. Then there is a theory T’ in L' such 
that: 
i £eCL GT CT 
(ii) |L'|= || 
(iti) every model of T can be expanded to a model of T’ 
(iv) if WV ET’ and B' CW’ then BSL ~KALL. 


Assuming the Proposition the Theorem easily follows: we define Lop = L, To = 
T, Lrai = (Ln)'s Trai = (Tn), £* = Une, £n and T* = Une. In. (i)-(iv) of the 
Theorem follow from the corresponding properties of the Proposition, making use 
of the elementary fact that every formula of £* will be a formula of some £L,, n € w, 


since it can contain just finitely many symbols. Thus, we now turn to the: 


PROOF OF THE PROPOSITION. L’ is the language which adds to £ a new n- 
place function symbol F3,, for every formula dyy(y, to,...,%n—1) of £ and a new 
constant symbol c3,, for every sentence dyyp(y) of L. 

We let X’ consist of all £-sentences of the following forms: 


syp(y) > 9(cayy) 


Vagrrrte: me ee] le Pe Ciel ea oes eo Peer eB 


for all formulas dyy of £. Finally, T’ = Cng/(T Ub’). The requirements of the 
Proposition are easily verified, using the Axiom of Choice for (iii) and the Lemma 
on page 77 for (iv). 


The functions (and constants) added to £ to obtain L’ are called Skolem func- 
tions for £, and the set ’ is the set of Skolem axioms for £L. Thus the language 
£* constructed in the proof of the Theorem has Skolem functions for £*, and 
T* =TU* where &* is the set of Skolem axioms for L*. 

A theory T’ having the property that whenever 2 — T’ and 8 C A then B ~ A 
is said to have Skolem functions, even if T’ is not constructed explicitly by adding 
“Skolem functions” to some theory in a smaller language. If 2 is a model of a 
theory having Skolem functions then §(X) ~ 2 for any X C A and H(X) is called 
the Skolem hull of X in 2. Also, X CY C A implies H(X) ~ H(Y). 

As a consequence of the existence of expansions with Skolem functions we note 
an improved Léwenheim-Skolem result. 


THEOREM 3.4. Given any 2, any X C A, and any infinite cardinal & such that 
max(|L], |X|) < « < |A| there is some 8 ~ A such that X C B and |X| =k. 
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Proor. Let T = Th(2) and let £*,T* be as in the Theorem, and let 2* be 
an expansion of 21 to a model of T*. We may suppose |X| = «. Thus |H*(X)| =« 
(since |£*| = |£| < «), and so B = H*(X) [CL as desired. 


In the remainder of this section we study Skolem hulls of indiscernible sets of 
elements, and derive several important consequences concerning models of arbitrary 
theories. 


DEFINITION 3.2. Let X C A and let < be a linear order of X. Then X isa 


set of indiscernibles (with respect to <) for 2 iff for all ai,...,an,a),...,a), EX 
such that a1 < ag < +--+ < Gp and ai < aj <--- < al, we have (,a1,...,an) = 
(A, a4,...,44) 


Note that the ordering of X is not assumed to be one of the relations of 21, or 
even definable in 2. Also nothing is assumed about the relative positions in the 
ordering of a,’s and ai,’s. Thus, if a, < ag < a3 < ag, all in X, indiscernibles for A, 
we have, e.g., 

(2, a1) = (A, a2) = (A, a3) = (A, as) 
(2, a1, a2) = (A, a1, a3) = (A, a1, a4) = (A, ae, a3) 
etc. However, one does not necessarily have (21, a1, a2) = (2, a4, a3). 

Probably the simplest non-trivial example of an infinite set of indiscernibles is 
X = A in any model 2 = (Q,<), where (in this case, though not in general) the 
ordering on X is <*. For another example, let £ have just a binary predicate FE, and 
let T' be the w-categorical theory asserting that E is an equivalence relation with 
infinitely many equivalence classes, all of them infinite. Then in any model 2 of T 
one can choose two radically different sets of indiscernibles, namely: X containing 
just elements from one equivalence class or Y containing only one element from each 
equivalence class. In either case the ordering is arbitrary. Note that 24 - 7~EG@1G 
for all ay,a2 € Y with a, 4 ag. 

The following, possibly surprising, result has many important consequences. 


THEOREM 3.5. Let T be a complete theory with infinite models. Let (B,<) be 
any linear ordering. Then there is some model A of T such that B C A and B is a 
set of indiscernibles for 2. 


Proor. We expand the language L of % to £(B) = LU {b: b € B}. Let d* 

be the result of adding to T all sentences of £(B) of the form 
(p(bi, «++ Bn)  G(BL,«--5Bp)) 

where y(21,...,2n) is a formula of £ and by <--: < by, bh <--- < U1, all from 
B. Any model of &* will be a model of T' with the required indiscernibles. We use 
compactness to show &* has a model. Let Uo C &* be finite. Then No contains 
only finitely many of the added sentences. Thus there is a finite Bp C B anda 
finite set ®y of £-formulas such that No is contained in T together iwth only the 
setences (y(b1,...,bn) © y(b;, sak 7) for yp € Bp and by < +--+ < bn, OL <--- < U, 
all in Bop. We may assume (by adding dummy variables) that all formulas in ®o 
have the same number of free variables, say n. Let 2 be any model of JT. We 
show how to interpret the constants in Bo as elements of A so as to obtain a model 
of No. First let < be any linear order of A. Let ag,...,@n,a},...,a), € A where 
a, < a2 <-+++ <a,» and a) <a <--:<aj,. We define {a1,...,an} ~ {a},...,a/,} 
to hold iff 2 E [p(@,...,Gn) @ v(@,...,G,)] for every y € &p. Then ~ is an 


eet) 
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equivalence relation on the collection Al”! of n-element subsets of A. Further (since 
®o is finite), this relation divides Al"! into finitely many equivalence classes. By 
a classical result of combinatorics, “Ramsey’s Theorem”, there must (since A is 
infinite) be an infinite set Ag C A such that all n-element subsets of Ag belong to 
the same equivalence class. Thus, if we interpret the (finitely many) elements of 
Bo, listed in increasing order, by any elements of Ao, also listed in increasing order, 
we obtain a model of No, as desired. 


Note that a set of indiscernibles X in 2 determines, for every n € w  {O}, a 
unique complete n-type ®(x1,...,2n,) which is the complete type of any increasing 
sequence of n elements of X in 2. If Y is a set of indiscernibles in %, then we 
say that X and Y have the same type, written tpy(X) = tpg(Y), iff X and Y 
determine exactly the same complete n-types for every n. The following result is 
easily established, and thus is left to the reader: 


THEOREM 3.6. Let X be an infinite set of indiscernibles in 2X and let (Y,<) 
be any infinite linear ordering. Then there is some 8 such that Y is a set of 
indiscernibles in B and tpy(X) = tp (Y). [In particular, A = B/. 


In theories with Skolem functions, the Skolem hulls of sets of indiscernibles are 
particularly well-behaved. The following summarises some of their most important 
properties. 


THEOREM 3.7. Let T be a complete theory (of £L) with Skolem functions. Let 
24,38 E T and suppose X,Y are infinite sets of indiscernibles in A,B respectively 
(with respect to <). Further, suppose that tpy(X) = tpg (Y). 

(1) H(X) and H(Y) are models of T realizing precisely the same n-types of L, for 
all n. 

(2) If h is an order-preserving map of X into Y then h extends to a unique iso- 
morphism h* of §(X) onto some elementary submodel of H(Y) — in fact, onto 
H({h(a):a€ X}). 

(3) If h is an order-preserving map of X onto itself, then h extends to a unique 
automorphism of §(X). 


Proor. First note that H(X) = {t7:t € Tmg,a1 < ag <-++ < dy all in X} 
and there is a corresponding representation for H(Y). Thus, let us suppose that 
§(X) realizes some 1-type ®(x). This means that for some t and a1 <-:: < Gy 
in X we have A, | v(t™(G,...,Gn)) for every y € ®. That is, v(t(x1,...,¢n)) 
belongs to the n-type of increasing sequences from X. Since we assumed that 
tpy(X) = tpy(Y) it follows that t?(b1,...,bn) realizes ®(x) in §(Y) for any 
by < bg <--- <b, in Y. The same argument for types in more than one variable 
establishes (1). 

In a similar way, the extension h* needed in (2) can be seen to be the map 
taking t™(a1,...,@n) to t?(h(a1),...,h(an)), for ay < ++ <@p in X. 

(3) follows immediately from (2). 


We can now establish the following very important result. 


THEOREM 3.8. Let T be a complete theory, in a countable language, having 
infinite models. Then there is a countable collection T of complete types (in all 
numbers of variables) such that for every K > w T has a model of power & which 
realizes precisely the complete types in T. 
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PROOF. We first choose an extension T* of T in £* as given by the Theorem 
on pg 120. Note that £* is still countable. Let X be a countably infinite set of 
indiscernibles in some model 2* of T*. Let T be the collection of complete types 
of £ realized in §(X)[£. Then T is countable since H*(X) is. Now given any 
& > w let (Y,<) be any linear ordering of cardinality «. By the Theorem on pg 
125, there is some model 8* = 2* such that Y is a set of indiscernibles in %8* and 
tp (Y) = tpg»(X). By Theorem (1) on pg 126, §*(Y) realizes precisely the same 
complete £*-types as 9*(X). In particular, §*(Y) [L£ is a model of T which realizes 
precisely the complete types in 7, and has power k. 


The reader should note that these are the first results which gives us any control 
over building uncountable models analogous to the control which the omitting types 
theorem yields over countable models. We are thus able to derive our first non- 
trivial result on the number of uncountable models of a theory. 


COROLLARY 3.9. Let T be a complete theory in a countable language, and 
assume there are 2” different complete types consistent with T. Then for every 
k >w T has at least 2” non-isomorphic models of cardinality k. 


Another important application of indiscernibles is the following: 


THEOREM 3.10. Let T be a complete theory, with infinite models, in a countable 
language. Then for every & > wT has a model of cardinality « with 2" automor- 
phisms. 


ProoF. As in the preceding Theorem we first expand T to a theory T* with 
Skolem functions. If (X,<) is a linear ordering of power « with 2“ automorphisms, 
and X is indiscernible in 8* — T*, then §*(X) [ZL is a model of T of cardinality « 
with 2” automorphisms, by Theorem (3) on page 126. 


4. Some Applications 


In this section we introduce no new techniques, but use previous material to 
derive some standard results, which are interesting both in themselves and as exam- 
ples of applications. Various further results in similar lines are mentioned briefly. 

Our first topic concerns results connecting the syntactical form of a sentence 
with properties of the class of models of the sentence. These are examples of 
preservation theorems. 


DEFINITION 4.1. (1) A sentence @ is a universal sentence (or, V-sentence) iff 6 
is Vao---Vana@ where a is open (i.e., has no quantifiers). 

(2) A set © os sentences if preserved under substructures iff for every A — &, if 
BCA then BE ¥. 


It is easily verified that any set of universal sentences is preserved under sub- 
structures. The theorem we are after states that the converse holds “up to equiva- 
lence”. More precisely: 


THEOREM 4.1. A theory T is preserved under substructures iff T = Cn(X) for 
some set & of universal sentences. 


We will derive this from a result stating when a model can be embedded in 
some model of T. 
The following notation is useful: 
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DEFINITION 4.2. For any theory T we define Ty = {V-sentences 0: T — 6}; in 
particular, we write Thy(2l) for (Th(2l))y = {V-sentences 0 : 2 — 0}. 


The Theorem on the preceding page is an immediate consequence (with © = Ty) 
of the following result: 


THEOREM 4.2. BE Ty iff B C A for some AE T. 


PrRooF. From right to left follows by our original remark that V-sentences are 
preserved under substructures. For the other direction, suppose 8 - Ty. To 
show that % can be embedded in some model of T is suffices (by the top Lemma 
on page 77) to show that TU Ag has a model. If not then, by compactness, 
T - 7a(bo,.--, bn) for some open £L(B)-sentence a(bo,..-, bn) true on 8. But then 
T & Vao-+-+Vain7a(20,-.--,2n), 80 V2o++:Vtn70(x,..-,2n) € Ty, a contradiction 
to our hypothesis that B — Ty. 


As a consequence, using compactness, we have the following: 


COROLLARY 4.3. {o} is preserved under substructures iff o -4 @ for some 
universal sentence @. 


The following is an important consequence of the above Theorem: 


CoroLiary 4.4. If B4 - Thy(2a4) then B C W’ for some A’ with A ~ A’ 
(where A CB). 


Note that a theory with Skolem functions is preserved under substructures, 
hence has a set of universal axioms. Similar results hold for preservation relative 
to another theory, where: 


DEFINITION 4.3. % is preserved under substructures relative to T iff for every 
AE TUN, ifBCAand BET then BE ™. 


Corresponding to the first Corollary on the preceding page we have, for exam- 
ple: 


THEOREM 4.5. {o} is preserved under substructures relative to T iffT E (ao 
0) for some V-sentence 0. 


We may also speak of a formula y(21,...,@n) being preserved, meaning that 
y(ci,---,Cn) is preserved where c),...,Cn are new constants. As a consequence of 
the preceding Theorem we thus obtain: 


COROLLARY 4.6. y(%) is preserved under substructures relative to T iff T — 
VE|p(Z) < @(%)| for some V-formula 6(£) (where the notion of V-formula has the 
obvious definition). 


Many other varieties of preservation results are known, for example concerning 
the following: 


DEFINITION 4.4. (1) A sentence 6 is an Vi-sentence iff @ is Vy +--+ Vandy1 ++: dyma(Z, 9) 
where a is open. 

(2) Sis preserved under unions of chains iff whenever {2l;};¢7 is a chain, under C, 
of models of © then U,<, %; F &. 


The corresponding preservation theorem is: 
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THEOREM 4.7. T is preserved under unions of chains iff T = Cn(X) for some 
set 4 of Va-sentences. 


The proof of the above Theorem, and the statements and proofs of its conse- 
quences and variations, are left to the reader. We also do not mention the numerous 
other preservation results concerning sentences preserved under homomorphism, di- 
rect products, etc. 

The next topic concerns theories for whose models substructure implies ele- 
mentary substructure. 


DEFINITION 4.5. T is model complete iff for all A,B — T if AC B then A ~ B. 


There are numerous natural examples of model complete theories, e.g. Th((Q, < 
)); certainly if T has Skolem functions then T is model complete but the converse 
fails. 

Note that a model complete theory need not be complete, but there is a simple 
sufficient condition for a model complete theory to be complete. 


LEMMA 4.8. Assume T is model complete and that there is some A T such 
that 2 can be embedded in every model of T. Then T is complete. 


ProoF. If 8 — T then 2 =~ %, hence 2 =~ 8 by hypothesis, so B = 2. 


Every theory can be expanded to a theory in a larger language which is model 
complete; more importantly, this expansion can be done in such a way (roughly, by 
adding only definable relations) that the new theory is “equivalent” to the old as 
far as most properties of their models are concerned. This was not the case with 
Skolem expansions. 


DEFINITION 4.6. Given any language £ we define L* as £L together with 
a new n-ary predicate symbol R, for every formula y(x1,...,%,) of L with n 
free variables. 
(1) &* = {Va[p(@) @ Ry(@)] : y() of L}. 


LEMMA 4.9. (1) |L| = |L*|. 
(2) Every L-structure A has exactly one expansion to an L*-structure A* | D*. 
(3) Cn(* UT) is model complete, for any theory T of L. 
(4) The following are equivalent: A < B,A* C BF, AF ~ B 
()AYB iffAat = Bre, 


We leave the proof of the Lemma to the reader. The only part requiring much 
argument is (3), for which one first proves (by induction) that for every formula 
w(#) of L* there is a formula y(%) of L such that U# E (w © yw), and hence 
BFL valu(z) & Rp(2)). 

The Lemma implies that we can replace any T’ by the model complete theory 
T# = Cn(TUS*) and not change, for example, the number of models in any power, 
on the relations of elementary embeddability between models. 

There is a simple characterization of model complete theories which is useful 
in determining whether a specific theory is model complete. 


n 


THEOREM 4.10. Given a theory T, the following are equivalent: 


(1) T is model complete, 
(2) For anyA, BET ifA CB then B4 & Thy(Ws) —i.e., whenever y(axo,..-,@n—1) 
is a universal formula of £L and 24 - ~p(G,.--,Gn—1) then Ba E v(Go,..-,Gn_1)- 


94 6. FURTHER TOPICS IN MODEL THEORY 


ProorF. (1) clearly implies (2). For the converse, suppose (2) holds. We define 
two elementary chains {2,}nc, and {B,z}new which “alternate” as follows: 

Let %q = A and Bo = B, where 2,8 — T are given and A C B. By (2), 
(Bo) a, FE Thy (2lo.4,), hence there is some 2; such that %o ~ lo and Bo C Ay — by 
the Corollary on the bottom of pg 130. Then, by (2) again, (24), F Thv(Boz,), 
hence there is some 8, such that 8g ~ 8B, and 2%, C %B,. Continuing in this way 
we obtain elementary chains {2 }new and {By}new such that 2%, C BC A,41 for 
all n. Thus we have A* = U¢,, An = Une, Bn = B*. But A ~ A* and B < B*, 


so we must have 2 ~ %, since 2 C B. 


In the next result, whose proof is left to the reader, we list two other conditions 
equivalent to being model complete — the first of which is the source of the name. 


THEOREM 4.11. The following are equivalent: 


(1) T is model complete, 

(2) For any AE T, (TUAg) ts a complete theory in L(A), 

(3) For every formula y(%) there is some universal formula 6(£) such that T 
(p + 6). 


Since a model complete theory is preserved under unions of chains, it must have 
a set of V4 axioms, by the Theorem on the bottom of page 131. Easy examples 
show that not all such theories are model complete, but the following result gives 
an interesting sufficient condition for model completeness. 


THEOREM 4.12. Assume T is preserved under unions of chains and that all 
uncountable models of T are w-saturated. Then T is model complete. 


It is known that if 7, in a countable L, is k-categorical for some & > w then 
all uncountable models of T’ are w-saturated. The converse has been conjectured 
— although this has been established for several important classes of theories, the 
question has still not been completely settled. 

This last theorem can be used to show that the theory of algebraically closed 
fields (in {+,-,0,1}) is model complete. 

Finally, we study the class of models of Th((w,<,0,S)), where s is the im- 
mediate successor function on w. This theory is an expansion by definitions of 
Th((w, <)) so all our results translate over to the theory in the smaller language, 
but we leave this translation to the reader. 

We will exhibit a finite set of axioms for Th((w,0,s)) and show this theory is 
model complete. This will enable us to determine the number of models in each 
cardinality of this theory and determine how they can be related under elementary 
embedding. 

Let ©% be the set of sentences (in <,0,s) saying that < is a linear order, 0 is 
the first element in the order, sx is the immediate successor in the order of x for 
any x, and 0 is the only element which does not have an immediate successor. 

Then every model of ©“ begins with a copy of w and is then followed by some 
number of copies of Z, linearly ordered in any way, and 0,s have their intended 
interpretations. Conversely, any such structure is a model of ©”. Note the following: 


LEMMA 4.13. Assume 2 / DY, a1,a2 € A and {a € A: ay <* a <*< ag} 
is infinite. Then A < B for some B such that {hE BN A: a, <® b <® ay} is 
infinite. 


95 


PRooF. % is any elementary extension of 2 realizing the type which says 
a@ < “,% < Gy, and there are infinitely many elements between @, and x and 
between x and @. 


We can now show that ©” is the desired set of axioms. 


THEOREM 4.14. (1) Cn(X”) is model complete. 
(2) Cn(E*) = Th((w, <,0, 8). 


ProorF. (1) Let 2,8 —- U”, 2 C B. By the Theorem on page 134 it suffices to 
show that B4 - Thy(24). So let a(zo,.-.-,%n—1, Yo,---;Y¥m—1) be an open formula 
of £ and let ao,...,@n—1 € A. We need to show that if B4 E Ayra(G,...,Gn—1,9) 
then also %4 F Ayra(G,...,Gn-1,9). So, let bo,...,bm—-1 € B be such that 
Be E -a(Go,...,Gn—1,b0,.--,bm—1). If b; ¢ A then b must be “infinitely far” (in 
the sense of <®) from every element of A. By repeated application of the preced- 
ing Lemma we obtain 2 ~< 2’ which has added elements in every “gap” inhabited 


by bo,...,bm—1. Thus, there are elements aj,...,a),, in A’ such that W,, - 
70(Go,.--,Gn—1, @, +--+; G1); Since all a can specify is the ordering and distances 
(finite) between elements. Thus in particular, %4 / Jyra(do,...,Gn—1,6) as de- 
sired. 


(2) follows from the Lemma on page 132, using 2% = (w, <,0, 2). 


What this tells us is the following: 


(a) every linear ordering (including the empty order) determines a model of Th((, < 
,0, s)) in which the copies of Z are so ordered; 

(b) non-isomorphic linear orderings determine non-isomorphic models of Th((w, < 
,0, 8)); 

(c) one model of Th((w, <,0,s)) can be elementarily embedded in another iff the 
linear ordering of the Z blocks in the first can be (isomorphically) embedded 
in the corresponding linear ordering in the second. 


Thus, by looking at appropriate linear orderings we derive results about the 
models of Th((w,0,s)) under elementary embedding. The main results we obtain 
are as follows: 

(1) for every & > w, Th((w, <,0,s)) has 2“ non-isomorphic models of cardinality 
K; 

Th((w, <,0,s)) has 2” non-isomorphic countable w)-universal models; 

Th((w, <,0,s)) has two countable models, realizing precisely the same types, 
neither of which can be elementarily embedded in the other. 

Note that it follows from [(1) and] (2) that Th((w, <, 0, s)) has 2” non-isomorphic 
countable models despite the fact that there are only countably many complete 
types consistent with Th((w, <,0,s)). 


>~—) 
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5. Exercises 


(1) Let £2” = {E} where E is a binary relation symbol. Let 2 be the countable 
L-structure in which E* is an equivalence relation on A with exactly one E™ 
class of size n for every positive integer n but with no infinite E™-classes. Define 
T = Th(Q). 

(a) Prove or disprove: T is model complete. 
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6. FURTHER TOPICS IN MODEL THI 


(b) Prove that 21 is a prime model of T. 
(c) Prove that T is not w-categorical. 


HORY 


APPENDIX A 


Appendix A: Set Theory 


The natural numbers (that is, non-negative integers) are used in two very dif- 
ferent ways. The first way is to count the number of elements in a (finite) set. The 
second way is to order the elements in a set-in this way one can prove things about 
all the elements in the set by induction. These two roles of natural numbers both 
are generalized to infinite numbers, but these are split into two groups according to 
the function they perform: cardinal numbers (to count) and ordinal numbers (to 
order). The basic facts and concepts are surveyed in this Appendix. 


1. Cardinals and Counting 


It was noted as long ago as Galileo that (some) infinite sets can be put into one- 
to-one correspondence with proper subsets of themselves, and thus, for example, 
that the set of integers may be considered as having the same “size” as the set 
of even integers. However no serious investigation into the “sizes” of infinite sets, 
on comparing them, was undertaken until Cantor, in the second half of the 19” 
century, created set theory, including both cardinal and ordinal numbers. 

The basic definitions about comparing the sizes of sets (including finite) are as 
follows: 


DEFINITION 1.1. (i) X ~ Y (X and Y are equivalent, or have the “same 
number” of elements) iff there is some function mapping X one-to-one onto Y. 
(ii) X x Y (X has “at most as many” elements as Y) iff there is some function 
mapping X one-to-one into Y. (iii) X ~ Y (X has strictly “fewer” elements than 
Y) iff X XY but not X.Y. 


The following proposition is certainly essential if one is to think of <x as some 
sort of ordering on sets. It was not proved, however, until the end of the 19*” 
century. 


PROPOSITION 1.1. If X XY andY XX thenX ~Y. 


All the basic facts about size comparisons could be expressed by the above 
notation but this would be quite clumsy. Instead certain sets, called cardinal num- 
bers, are picked out so that for every set X there is exactly one cardinal number « 
such that X ~ «. We then call « the cardinality of X and write |X| =. |X| < |Y| 
means X ~< Y and |X| < |Y| means X < Y. Notice that |«| = « if, and only if, « 
is a cardinal number. 

The first cardinal numbers are defined as follows: 
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nt+l=nU{n+1} 


w = {0,1,...,n,...} is defined as the smallest set containing 0 and such that 
if x ew then cU {a} Ew. 
Notice that w+1 = wU{w} cannot also be a cardinal number since w ~ wU{w}. 


DEFINITION 1.2. (a) X is finite iff X ~ n for some n € w (|X| Ew). (b) X is 
countable iff X < w (ie. |X| <w). 


LEMMA 1.2. (i) X is finite iff |X| <w. (ii) X is countable and infinite iff 
|X| =w. 

[The essential content of this lemma is that all cardinals less than w in fact 
belong to w]. 


One of Cantor’s fundamental discoveries is that there are infinite sets which 
are not equivalent, and in fact that there can be no biggest cardinal number. 


DEFINITION 1.3. The power set of X, is defined by P(X) = {Y| Y C X}. 
THEOREM 1.3. For ever X, X <~ P(X). 


PROOF. Obviously, X < P(X). Suppose that X ~ P(X), say that h maps 
X bijectively onto P(X). Let D = {x € X| « ¢ h(x)}. Then D = h(d) for some 
dé xX. But de D iff d € h(d) = D, which is a contradiction. 


Thus, there must be cardinals « such that w < «. We must put off defining 
them, however, until after we introduce ordinal numbers—we also should admit that 
we will need the Axiom of Choice (AC) to define cardinal numbers in general. 
Recall that AC states that if X is a set of non-empty sets, then there is a function 
f defined on X such that f(a) € x for every x € X. 

It is important to know that many set-theoretic operations lead from countable 
sets to countable sets. 


DEFINITION 1.4. (a) Yx is the set of all functions f with domain Y and range 


a subset of X. (b) ~X = Unew x is the set of all finite sequences of elements 
of X (thinking of a sequence of length n as a function defined on n); an alternate 
notation is 7 X. 


THEOREM 1.4. If X is countable then so is oO: 
Proor. It suffices to show that ~w ~< w, which follows by using the one-to-one 


map which sends (ko,...,kn—1) to 2*o+!.3hi+1... ptt: where p; is the j“" odd 
prime. 


COROLLARY 1.5. If X,Y are countable so are X UY and X x Y. 
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THEOREM 1.6. (AC) If X» is countable for every n €w then VU 
countable. 


new Xn 18 also 


w is, in fact, the the smallest infinite cardinal, although the proof requires the 
axiom of choice. 
PROPOSITION 1.7. (AC) If X is infinite then w x X. 


The analogues of + and - trivialize on infinite cardinals because of the preceding 
corollary, but exponentiation is important. 


NOTATION 3. If «,\ are cardinal numbers then x«* is the cardinal |A,,|. 
LEMMA 1.8. For any X, P(X) ~ Xo. 
Hence from Cantor’s Theorem we see the following corollary. 


COROLLARY 1.9. If |X| = « then |P(X)| = 2", and sow < 2" for every cardinal 


However, increasing the base does not yield still larger cardinals. 
LEMMA 1.10. 2% = n® = w” = (2”)%, anyn Ew. 
PRooF. It suffices to show (2”)” < 2”, which follows since 


Wwe) © (WX W)2 ~ We. 


Without proof we list some facts about (uncountable) cardinalities, all depend- 
ing on AC. 

(1) IfX is infinite then |X| = |X]. 

(2) If X,Y are infinite then |X UY| =|X x Y| = maz(|X|,|Y]). 

(3) If |Z| <« and |X;| < « for all i € J, then |U,-, Xi] < &, 
for kK > w. 

(4) (ut) = pmo), for K,A > w, p> 2. 

(5) For any cardinal « there is a unique nezt cardinal called «*, but 
there is no set X such that K < X ~ Kt. 

(6) If X is a non-empty set of cardinal numbers, then J X is a 
cardinal number and it is the first cardinal < all cardinals in _X. 

(7) («t)* = maz(«, K*) for 6, A > w. 

(8) For any sets X,Y either X x Y or Y = X, hence for any 
cardinals «, A either k <A or A < k. 

Some notation, based on (5), is the following which we will extend in the next 
section: wy = Wt, Wn41 = Wt, Wy = Unew Yn-writing also wo = w. An alternate 
notation is to use the Hebrew letter “aleph’—thus No, Ni,...,Xw,--- 

Note that w; < 2” and, in general, kt < 2* for each k > w. It is natural to 
enquire about whether equality holds or not. 
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CONJECTURE 1.1. Continuum Hypothesis (CH): 2% = w, 


CONJECTURE 1.2. Generalized Continuum Hypothesis (GCH): For every 


infinite cardinal K, 2% = KT. 
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CH and GCH are consistent with, but independent of, the usual axiioms of set 
theory. In fact, each of the following is consistent with the usual axioms: 


2° = wy, 2” =w2,2” =w, for anyn €w 
2° =i) o" = (Win)? ees 
We can, however, prove that 2” 4 uw, since we have (w,)” > ww. 
Some further facts about cardinals will be presented at the end of the next 
section. 


2. Ordinals and Induction 


The principles of proof by induction and definition by recursion on the natural 
numbers are consequences just of the fact that w is well-ordered by the usual order. 


DEFINITION 2.1. (X,<) is a well-ordering iff < is a linear order of X and every 
non-empty subset Y C X contains a least element, i.e. there is some ap € Y such 
that a9 <a forallacY. 


THEOREM 2.1. (Proof by Induction) Let (X,<) be a well-ordering. Let AC X 
have the property that for everya € X, ifb € A for allb <a thena € A. Then 
A=X. 


PrRooF. If not, consider Y = X —A and obtain a contradiction to the definition. 


The way this is used if one wants to prove that all elements of X have property 
P is to let A be the set of all elements of X having property P. 
In a similar vein, we see: 


THEOREM 2.2. (Definition by Recursion) Let (X,<) be a well-ordering. Let Y 
be any non-empty set and let g be a function from P(Y) into Y. Then there is a 
unique function f from X into Y such that for everya Ee X, 


f(a) = g({f(a)| « € X,x < a}). 


[Less formally, this just says that f(a) is defined in terms of the f(x)’s for 
xr<a. 

As in the previous section, we wish to pick out particular well-orderings, called 
ordinal numbers, such that each well-ordering is isomorphic to exactly one ordinal 
number. We do this so that the well order < of the ordinal is as natural as possible— 
that is, is give by €. The precise definition we obtain is as follows: 


DEFINITION 2.2. A set X is an ordinal number iff (i) «eye xX s>uvrexXx 
(equivalently, y € X > y C X), and (ii) X is well-ordered by the relation < defined 
by: ax biffacbora=b. 


Condition (i) is frequently expressed by saying “X is transitive” and condition 
(ii) is loosely expressed by saying “€ well-orders X.” Note that technically X is 
not a well-ordering, but (X,<) is-however condition (ii) determines < completely 
from X. Notice, of course, that most sets aren’t even linearly ordered by €-— 
in fact, one of the usual (but somewhat technical) axioms of set theory implies 
that if X is linearly ordered by €, then in fact it is well-ordered by €. Thus the 
conditions in (ii) could be expanded to read: (ii)*: « €y,yEz,2E€X S>a2 Ez, 
vuyeX >u=yVureyVye.s. («x ¢2x) follows by the usual axioms. 
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Notice that the finite cardinal numbers and w, as defined in the previous section, 
are also ordinal numbers. The following lemma gives some of the basic properties 
of ordinals. By convention, we normally use Greek letters a,(,... to stand for 
ordinals. 


LEMMA 2.3. (1) If a is an ordinal and x € a then x is an ordinal. (2) If a, 6 
are ordinals then either a € 8 ora = 8 or B € a. (3) If a is an ordinal then 
at+1=aU{a} is an ordinal. (4) If X is a set of ordinals then () X is an ordinal. 


NOTATION 4. If a, are ordinals we write a < G for a € 8. Part (1) of the 
lemma states that if a is an ordinal then a = {(| 6 is an ordinal, 8 < a}. The 
ordinal a + 1 is the immediate successor of a-that is, a < a+ 1 and there is no 
ordinal 6 such that a < 6 <a+1. Similarly, UJ X is the least upper bound of the 
set X of ordinals. 


The class of all ordinals is not a set, but we can still think of it as well-ordered 
by <. Further, we can prove things about the class of all ordinals by induction, 
and define functions on ordinals by recursion. 

Finally we note that ordinals do have the property for which we introduced 
them. 


THEOREM 2.4. Let (X,<) be a well-ordering. Then there is exactly one ordinal 
a such that (X,<) = (a, <). 


We distinguish between two types of non-zero ordinals as follows: 


DEFINITION 2.3. @ is a successor ordinal iff a = 6 +1 for some ordinal (£; a is 
a limit ordinal iff a 4 0 and a is not a successor ordinal. 


Note that a is a limit ordinal iff a 4 0 and Ua =a. If X is any non-empty 
set of ordinals not containing a largest ordinal, then LJ X is a limit ordinal. 

It is frequently more convenient to break proofs by induction, or definitions 
by recursion, into cases according to whether an ordinal is a successor or a limit 
ordinal. For example, the recursive definition of ordinal addition is as follows: 

if 6 =0 thena+8=a, 
if8=y+1thna+6=(a+y7)+1, 
if 6 is a limit then a+ 6B =U{at+ | 7 < 6}. 

While most linear orderings (X, <) are not well-orderings, there is no restriction 
on the sets X in well-orderings, by the next theorem. This means that proof by 
induction can (in principle) be applied to any set. 


THEOREM 2.5. (AC) For every set X there is some < which well-orders X. 
As an immediate consequence of the two preceeding theorems we have: 


COROLLARY 2.6. (AC) For every set X there is some ordinal a@ such that 
X~a. 


The ordinal a is not unique unless a < w, since ifw <a thena~a-+l1, but 
the least such ordinal will be the cardinality of X. 


DEFINITION 2.4. « is a cardinal number iff « is an ordinal number and for every 
a < & we have a ~ « (equivalently, for every ordinal a such that a ~ & we have 
kK <a). 
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This fills the lacuna in the preceding section. Note that the cardinal numbers 
are well-ordered by <, and < is € on cardinal numbers. 
The way we will customarily use proof by induction on an arbitrary set X is 
as follows: let |X| = « so there is some one-to-one function h mapping « onto X. 
Write x for h(a). Then X = {| a < «} and we prove what we want about rq 
by induction on a < «. Note that for each a < & we have |{xg| 8 < a}|=a<k. 
The class of infinite cardinals can be indexed by the class of ordinals by using 
the following definition by recursion: 
w(0) =w, 
w(y +1) = (w(y))*, 
Ba limit > w(3) = U{wl7)| 7 < By. 
We normally write w, instead of w(y). 
We finally need to introduce the concept of cofinality in order to make the 
important distinction between regular and singular cardinals. 


DEFINITION 2.5. Let a, @ be limit ordinals. Then a is cofinal in @ iff there is 
a strictly increasing function f € ag such that U{f(7)| y < a} = 8. 


Certainly @ is confinal in (. w is cofinal in every countable limit ordinal, but 
w is not cofinal in w. 


DEFINITION 2.6. Let 6 be a limit ordinal. Then the cofinality of 8 is cf(G) 
equals the least a such that a is confinal in £. 


LEMMA 2.7. For any limit ordinal 6, cf (8) < 8 and cf(8) ts a cardinal. 


DEFINITION 2.7. Let « be an infinite cardinal. Then « is regular iff « = cf(k). 
« is singular iff cf(K) < K. 


DEFINITION 2.8. « is a successor cardinal iff « = A+ for some cardinal 4, i.e. 
kK = wg41 for some fp. 


DEFINITION 2.9. « is a limit cardinal iff « > w and « is not a successor cardinal, 
L.€., & = We for some limit ordinal a. 


The division of infinite cardinals into regular and singular is almost the same 
as the division into successor and limit. 


THEOREM 2.8. (1) Every successor cardinal is regular. (2) if & = we is a limit 
cardinal, then cf («) = cf(a)-hence if & is regular then K = wy. 


Regular limit cardinals are called inaccessible cardinals—their existence cannot 
be proved from the usual axioms of set theory. 

With cofinalities we can state a few more laws of cardinal computation, con- 
tinuing the list from the previous section. 

(9) «°f() > « for every cardinal K > w. 
(10) Assume that |J| < cf(«) and for every 7 € I, |Xi| < k. 
Then | U;e7 Xi| < &. 

It is frequently tempting to assume GCH because it simplifies many computa- 
tions, e.g.: Assuming GCH we have, for any cardinals k, A > w, K* = K if \ < cf(k), 
RS ATP ef KR) AS SHAT IPRS: 
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Appendix B: Notes on Validities and Logical 
Consequence 


1. Some Useful Validities of Sentential Logic 
1) Excluded Middle 


FOV 76 
F 7(¢ A 74) 


2) Modus Ponens 
bO70FY 


3) Conjunction 


YR ory 


4) Transitivity of Implication 
e7v,~y~rdrFord 


5) Plain Ol True as Day 


oF evVy 
o> (p> 8), ¢74F O78 


6) Proof by Contradiction 
79 > (PAY) Fo 
“bbb SO 


7) Proof by Cases 
d$>V,0>4,F (V9) yu 
o>,-670FY 


8) De Morgan’s Laws 
Ad Vb) F476 Amp 
AdA W) F476 V wy 


9) Distributive Laws 
PA(WVO)FA(PAY) V (GAB) 
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B. APPENDIX B: NOTES ON VALIDITIES AND LOGICAL CONSEQUENCE 


PV (PAO) FA(PVY) A (OVA) 


10) Contraposition 


> pra = > 7 


11) The connectives A and V are both commutative and associative. 


1) NU {o} 


2) If = E ¢; for each i =1,...,n and {q),... 


2. Some Facts About Logical Consequence 


LK ap iff D 


EGY 


3) XE 6 iff NU {7¢} is not satisfiable. 


son} 


Ky then DE v. 
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Appendix C: Gothic Alphabet 


a Aa b Bb c Ec d Do 
e €e f£ Sf g Gg h Hh 
i vi j FY k Re 1 Lt 
m Vin n Mn o Yo p Pp 
q Qq vr Rr s Gs t Ft 
u Lu v Vo w Wo x Xe 
y Yo z 33 
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ordinals, 100 


power set, 98 
prime, 74 
principal, 74 


quantifiers, 24 


realize, 61 
locally, 63 
recursion, 70 
definition by, 12, 18, 64, 100 
reduct, 29 
regular cardinal, 102 
rule, 37 


satisfiability, 13, 31, 33, 34, 67 
finite, 62 
sentence, 8, 24 
singular cardinal, 102 
soundness, 38 
structure, 28 
canonical, 50 
submodel, 66 
elementary, 66 
substitution, 32 
successor, 26, 101 
symbol, 8, 23 
constant, 24 
function, 24 
logical, 24 
non-logical, 24 
relation, 24 
syntax, 8 


tautology, 14, 35, 38 
terms, 25 
theory, 43 

complete, 43 
truth assignment, 11 
truth table, 15 
type, 61 

complete, 65, 74 
types 

omitting, 66 


unique readability, 9, 25 
universe, 44 


valid, 38 
validity, 14, 19, 33 
variable, 24 

free, 27 


well-ordering, 100 
witness, 50 


